Unsupervised abnormal detection using VAE with memory

被引：8

作者：

Xie, Xin ^{[1
]}

Li, Xinlei ^{[1
]}

Wang, Bin ^{[1
]}

Wan, Tiancheng ^{[1
]}

Xu, Lei ^{[1
]}

Li, Huiping ^{[2
]}

机构：

[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang, Jiangxi, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu, Peoples R China

来源：

SOFT COMPUTING | 2022年 / 26卷 / 13期

基金：

中国国家自然科学基金;

关键词：

Anomaly detection; Unsupervised learning; Variational autoencoder; Generative adversarial networks; MHMA; ANOMALY DETECTION; NOVELTY DETECTION;

D O I：

10.1007/s00500-022-07140-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Anomaly detection based on generative models usually uses the reconstruction loss of samples for anomaly discrimination. However, there are two problems in semi-supervised or unsupervised learning. One is that the generalizing ability of the generator is too strong, which may reduce the reconstruction loss of some outliers. The other is that the background statistics will interfere with the reconstruction loss of outliers. Both of them will reduce the effectiveness of anomaly detection. In this paper, we propose an anomaly detection method called MHMA (Multi-headed Memory Autoencoder). The variational autoencoder is used as the generation model, and the vector in potential space is limited by the memory module, which increases the reconstruction error of abnormal samples. Moreover, the MHMA uses the multi-head structure to divide the last layer of the decoder into multiple branches to learn and generate a diverse sample distribution, which keeps the generalization capability of the model within a reasonable range. In the process of calculating outliers, a likelihood ratio method is employed to obtain correct background statistics according to the background model, thus enhancing the specific features in the reconstructed samples. The effectiveness and universality of MHMA are tested on different types of datasets, and the results show that the model achieves 99.5% recall, 99.9% precision, 99.69% F1 and 98.12% MCC on the image dataset and it achieves 98.61% recall, 98.73% precision, 98.67% F1 and 95.82% MCC on the network security dataset.

引用

页码：6219 / 6231

页数：13

共 36 条

[21]

LeCun Y., 1998, MNIST handwritten digit database

[22]

Leglaive S, 2018, IEEE 28 INT WORKSHOP, P16

[23]

Lichman M., 2013, UCI MACHINE LEARNING

[24] Isolation Forest [J].

Liu, Fei Tony ;

Ting, Kai Ming ;

Zhou, Zhi-Hua .

ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :413-+

[25] Novelty detection: a review - part 2: neural network based approaches [J].

Markou, M ;

Singh, S .

SIGNAL PROCESSING, 2003, 83 (12) :2499-2521

[26] Novelty detection: a review - part 1: statistical approaches [J].

Markou, M ;

Singh, S .

SIGNAL PROCESSING, 2003, 83 (12) :2481-2497

[27] A review of novelty detection [J].

Pimentel, Marco A. F. ;

Clifton, David A. ;

Clifton, Lei ;

Tarassenko, Lionel .

SIGNAL PROCESSING, 2014, 99 :215-249

[28]

Schlegl T., 2017, LECT NOTES COMPUTER, P146, DOI [10.1007/978-3-319-59050-9_12, DOI 10.1007/978-3-319-59050-9_12]

[29]

Schlkopf B, 2000, ADV NEURAL INFORM PR

[30]

Tan S. C., 2011, PROC INT JOINT C ART, P1511

← 1 2 3 4 →