An irrelevant attributes resistant approach to anomaly detection in high-dimensional space using a deep hypersphere structure

被引：12

作者：

Zheng, Jian ^{[1
]}

Qu, Hongchun ^{[1
,2
]}

Li, Zhaoni ^{[1
,3
]}

Li, Lin ^{[1
]}

Tang, Xiaoming ^{[2
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China

[2] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China

[3] Qinghai Normal Univ, Coll Comp, Xining 810008, Qinghai, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2022年 / 116卷

关键词：

Anomaly; Deep networks; Hypersphere; High dimension; Irrelevant attributes; SUPPORT VECTOR MACHINE; GENERATIVE ADVERSARIAL NETWORKS; OUTLIER DETECTION; OPTIMAL TRANSPORT; BARYCENTERS; CLASSIFIER; POINT; TSVH; SVM;

D O I：

10.1016/j.asoc.2021.108301

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is a grand challenge to detect anomalies existing in subspaces from a high-dimensional space. Most existing state-of-the-art methods implicitly or explicitly rely on distances. Since the contrast, e.g., distances, between data objects in a high-dimensional space becomes more and more similar. Moreover, high-dimensional spaces may include many irrelevant attributes masking anomalies (if the prior probability for a class remains unchanged regardless of the value observed for attribute att, att is said to be irrelevant to a class, i.e., att is an irrelevant attribute). Obviously, anomalies can exist in any of subspaces, so it is difficult to select subspaces that highlight the relevant attributes in an exponential searching space. To address this issue, we proposed a hybrid method consisting of a deep network and a hypersphere to detect anomalies. The deep network in the proposed method is used as a feature extractor to capture the low-dimensional features from the background space. Then, anomalies are separated by using the hypersphere in the feature space reconstructed by probability distribution. To prevent irrelevant attributes from being mistaken for anomalies during mining anomalies, the upper of the number of anomalies is estimated by the Chebyshev theorem. Finally, the proposed method was verified on synthetic datasets and real-world datasets. Experimental results show that the proposed method outperforms the existing state-of-the-art detection methods in regard to the accuracy of mining anomalies and the ability of noise resistance. We find that feature extractors can improve the ability of noise resistance for anomalous detection methods. In the feature space reconstructed by probability distribution, anomalous features are easily identified from irrelevant features and normal features. We also indicate that irrelevant attributes increase the complexity of the feature space, through calculating the probability distribution of data in the background space, the layered features can be extracted to distinguish anomaly classes, normal classes, and irrelevant attribute classes. (C) 2021 Elsevier B.V. All rights reserved.

引用

页数：20

共 91 条

[31] Reducing the dimensionality of data with neural networks
Hinton, G. E.
Salakhutdinov, R. R.
[J]. SCIENCE, 2006, 313 (5786) : 504 - 507
[32] Anomaly Detection for a Water Treatment System Using Unsupervised Machine Learning
Inoue, Jun
Yamagata, Yoriyuki
Chen, Yuqi
Poskitt, Christopher M.
Sun, Jun
[J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 1058 - 1065
[33] Unmasking the abnormal events in video
Ionescu, Radu Tudor
Smeureanu, Sorina
Alexe, Bogdan
Popescu, Marius
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2914 - 2922
[34] Javaid A., 2016, P 9 EAI INT C BIOINS, DOI DOI 10.4108/EAI.3-12-2015.2262516
[35] Optimizing Over Radial Kernels on Compact Manifolds
Jayasumana, Sadeep
Hartley, Richard
Salzmann, Mathieu
Li, Hongdong
Harandi, Mehrtash
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3802 - 3809
[36] Jerone T.A., 2016, INT J MACH LEARN COM, V6
[37] Multi-Class Supervised Novelty Detection
Jumutc, Vilen
Suykens, Johan A. K.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) : 2510 - 2523
[38] Kantorovich Leonid Vitalevich, 1948, USP MAT NAUK, P225, DOI [10.1007/s10958-006-0050-9, DOI 10.1007/S10958-006-0050-9]
[39] Deep learning with support vector data description
Kim, Sangwook
Choi, Yonghwa
Lee, Minho
[J]. NEUROCOMPUTING, 2015, 165 : 111 - 117
[40] Kingma D P., 2014, P INT C LEARN REPR

← 1 2 3 4 5 6 7 8 9 10 →