An irrelevant attributes resistant approach to anomaly detection in high-dimensional space using a deep hypersphere structure

被引:12
作者
Zheng, Jian [1 ]
Qu, Hongchun [1 ,2 ]
Li, Zhaoni [1 ,3 ]
Li, Lin [1 ]
Tang, Xiaoming [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China
[3] Qinghai Normal Univ, Coll Comp, Xining 810008, Qinghai, Peoples R China
关键词
Anomaly; Deep networks; Hypersphere; High dimension; Irrelevant attributes; SUPPORT VECTOR MACHINE; GENERATIVE ADVERSARIAL NETWORKS; OUTLIER DETECTION; OPTIMAL TRANSPORT; BARYCENTERS; CLASSIFIER; POINT; TSVH; SVM;
D O I
10.1016/j.asoc.2021.108301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is a grand challenge to detect anomalies existing in subspaces from a high-dimensional space. Most existing state-of-the-art methods implicitly or explicitly rely on distances. Since the contrast, e.g., distances, between data objects in a high-dimensional space becomes more and more similar. Moreover, high-dimensional spaces may include many irrelevant attributes masking anomalies (if the prior probability for a class remains unchanged regardless of the value observed for attribute att, att is said to be irrelevant to a class, i.e., att is an irrelevant attribute). Obviously, anomalies can exist in any of subspaces, so it is difficult to select subspaces that highlight the relevant attributes in an exponential searching space. To address this issue, we proposed a hybrid method consisting of a deep network and a hypersphere to detect anomalies. The deep network in the proposed method is used as a feature extractor to capture the low-dimensional features from the background space. Then, anomalies are separated by using the hypersphere in the feature space reconstructed by probability distribution. To prevent irrelevant attributes from being mistaken for anomalies during mining anomalies, the upper of the number of anomalies is estimated by the Chebyshev theorem. Finally, the proposed method was verified on synthetic datasets and real-world datasets. Experimental results show that the proposed method outperforms the existing state-of-the-art detection methods in regard to the accuracy of mining anomalies and the ability of noise resistance. We find that feature extractors can improve the ability of noise resistance for anomalous detection methods. In the feature space reconstructed by probability distribution, anomalous features are easily identified from irrelevant features and normal features. We also indicate that irrelevant attributes increase the complexity of the feature space, through calculating the probability distribution of data in the background space, the layered features can be extracted to distinguish anomaly classes, normal classes, and irrelevant attribute classes. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:20
相关论文
共 91 条
  • [1] Aggarwal C. C., 2013, OUTLIER ANAL, DOI [DOI 10.1007/978-1-4614-6396-2, 10.1007/978-1-4614-6396-2]
  • [2] Aggarwal C. C., 2016, Outlier analysis, V2nd
  • [3] BARYCENTERS IN THE WASSERSTEIN SPACE
    Agueh, Martial
    Carlier, Guillaume
    [J]. SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2011, 43 (02) : 904 - 924
  • [4] Ai Qing, 2018, 2018 IEEE 7 DAT DRIV
  • [5] GANomaly: Semi-supervised Anomaly Detection via Adversarial Training
    Akcay, Samet
    Atapour-Abarghouei, Amir
    Breckon, Toby P.
    [J]. COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 622 - 637
  • [6] A fixed-point approach to barycenters in Wasserstein space
    Alvarez-Esteban, Pedro C.
    del Barrio, E.
    Cuesta-Albertos, J. A.
    Matran, C.
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2016, 441 (02) : 744 - 762
  • [7] Data outlier detection using the Chebyshev theorem
    Amidan, Brett G.
    Ferryman, Thomas A.
    Cooley, Scott K.
    [J]. 2005 IEEE Aerospace Conference, Vols 1-4, 2005, : 3814 - 3819
  • [8] Discrete Wasserstein barycenters: optimal transport for discrete data
    Anderes, Ethan
    Borgwardt, Steffen
    Miller, Jacob
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 84 (02) : 389 - 409
  • [9] Andrews J.T.A., 2016, Int. J. Mach. Learn. Comput., V6, P21, DOI 10.18178/ijmlc.2016. 6.1.565
  • [10] [Anonymous], 2016, P INT JOINT C ART IN