Abstract:
Modelling normal data is one of the major challenges in outlier detection. Deep learning has been proven to be effective in modelling underlying distributions of input tr...Show MoreMetadata
Abstract:
Modelling normal data is one of the major challenges in outlier detection. Deep learning has been proven to be effective in modelling underlying distributions of input training data. However, the existing deep learning-based methods normally focus on how to alleviate the negative impact from the presence of outliers when they model the full dataset. Besides, insufficient size of training data also leads to unsatisfactory training of deep networks. This paper proposes a WGAN-empowered deep Autoencoder-based Outlier detection approach (GAEO for short) which presents a novel way to directly and effectively model reliable normal data. GAEO first obtains initial normal data by a proposed initial distance-based outlier scoring function, then constructs a WGAN network to perform normal data augmentation to obtain adequate training samples for subsequent normal data modelling. The deep autoencoder is then trained to discover and yield distributions and patterns of normal data, and the reconstruction error is defined as outlierness of each data object. In our experiments, we investigate the effectiveness of our method GAEO compared with three state-of-the-art outlier detectors on ten real-world datasets, and discuss the impact of the parameters in GAEO. We show that our method GAEO significantly outperforms its contenders by 4% to 20% AUC improvement.
Published in: 2019 IEEE 9th International Conference on Electronics Information and Emergency Communication (ICEIEC)
Date of Conference: 12-14 July 2019
Date Added to IEEE Xplore: 05 August 2019
ISBN Information: