An Anomaly Detection Method Combining Mutual Information Estimation with Adversarial Autoencoder

被引:0
|
作者
Huo W.-G. [1 ,2 ]
Wang X. [2 ]
Liang R. [2 ]
机构
[1] Information Security Evaluation Center of Civil Aviation, Civil Aviation University of China, Tianjin
[2] School of Computer Science and Technology, Civil Aviation University of China, Tianjin
来源
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications | 2021年 / 44卷 / 05期
关键词
Adversarial autoencoder; Anomaly detection; Deep generative mode; Mutual information estimation; Semi-supervised learning;
D O I
10.13190/j.jbupt.2021-009
中图分类号
学科分类号
摘要
According to the information theory, the training objective of the unsupervised deep learning networks can be interpreted as maximizing the mutual information between the training samples and their representations. Adversarial autoencoder (AAE) learns the distribution of the training samples by the generative adversarial method. So the semi-supervised anomaly detection model based on the normal sample sets can be established using AAE. However, AAE cannot maximize the mutual information between the normal samples and their representations explicitly. A semi-supervised anomaly detection method based on mutual information estimation network and AAE (IAAE) is proposed. Firstly, the encoder and decoder of the AAE are trained to minimize the reconstruction error. Then, in the adversarial regularization stage, the aggregated posterior of the normal sample's representations are matched to the arbitrary prior distribution, and the mutual information between normal samples and their representations is maximized. Finally, the mutual information between normal samples and their representations are estimated by fully connected neural network. The reconstruction error of the test sample and its mode divergence in the hidden space are used to calculate the abnormal score. The experimental results on public datasets show that the IAAE has better performance than the existing typical deep anomaly detection models in terms of F1 values. © 2021, Editorial Department of Journal of Beijing University of Posts and Telecommunications. All right reserved.
引用
收藏
页码:28 / 34
页数:6
相关论文
共 17 条
  • [1] Pang Guansong, Shen Chunhua, Cao Longbing, Et al., Deep learning for anomaly detection: a review, ACM Computing Surveys, 54, 2, pp. 1-38, (2021)
  • [2] Ribeiro M, Lazzaretti A E, Lopes H S., A study of deep convolutional auto-encoders for anomaly detection in videos, Pattern Recognition Letters, 105, pp. 13-22, (2018)
  • [3] Zhou Chong, Paffenroth R C., Anomaly detection with robust deep autoencoders, 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 665-674, (2017)
  • [4] Zong Bo, Song Qi, Min M R, Et al., Deep autoencoding Gaussian mixture model for unsupervised anomaly detection, International Conference on Learning Representations, pp. 1-19, (2018)
  • [5] An J, Cho S., Variational autoencoder based anomaly detection using reconstruction probability, Special Lecture on IE, 2, 1, pp. 1-18, (2015)
  • [6] Yao Rong, Liu Chongdang, Zhang Linxuan, Et al., Unsupervised anomaly detection using variational auto-encoder based feature extraction, 2019 IEEE International Conference on Prognostics and Health Management (ICPHM), pp. 1-7, (2019)
  • [7] Schlegl T, Seebock P, Waldstein S M, Et al., Unsupervised anomaly detection with generative adversarial networks to guide marker discovery, International Conference on Information Processing in Medical Imaging, pp. 146-157, (2017)
  • [8] Zenati H, Romain M, Foo C S, Et al., Adversarially learned anomaly detection, 2018 IEEE International Conference on Data Mining (ICDM), pp. 727-736, (2018)
  • [9] Li Chunyuan, Liu Hao, Chen Changyou, Et al., ALICE: towards understanding adversarial learning for joint distribution matching, Advances in Neural Information Processing Systems, pp. 5495-5503, (2017)
  • [10] Pidhorskyi S, Almohsen R, Doretto G., Generative pro-babilistic novelty detection with adversarial autoencoders, Advances in Neural Information Processing Systems, pp. 6822-6833, (2018)