An irrelevant attributes resistant approach to anomaly detection in high-dimensional space using a deep hypersphere structure

被引:12
作者
Zheng, Jian [1 ]
Qu, Hongchun [1 ,2 ]
Li, Zhaoni [1 ,3 ]
Li, Lin [1 ]
Tang, Xiaoming [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China
[3] Qinghai Normal Univ, Coll Comp, Xining 810008, Qinghai, Peoples R China
关键词
Anomaly; Deep networks; Hypersphere; High dimension; Irrelevant attributes; SUPPORT VECTOR MACHINE; GENERATIVE ADVERSARIAL NETWORKS; OUTLIER DETECTION; OPTIMAL TRANSPORT; BARYCENTERS; CLASSIFIER; POINT; TSVH; SVM;
D O I
10.1016/j.asoc.2021.108301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is a grand challenge to detect anomalies existing in subspaces from a high-dimensional space. Most existing state-of-the-art methods implicitly or explicitly rely on distances. Since the contrast, e.g., distances, between data objects in a high-dimensional space becomes more and more similar. Moreover, high-dimensional spaces may include many irrelevant attributes masking anomalies (if the prior probability for a class remains unchanged regardless of the value observed for attribute att, att is said to be irrelevant to a class, i.e., att is an irrelevant attribute). Obviously, anomalies can exist in any of subspaces, so it is difficult to select subspaces that highlight the relevant attributes in an exponential searching space. To address this issue, we proposed a hybrid method consisting of a deep network and a hypersphere to detect anomalies. The deep network in the proposed method is used as a feature extractor to capture the low-dimensional features from the background space. Then, anomalies are separated by using the hypersphere in the feature space reconstructed by probability distribution. To prevent irrelevant attributes from being mistaken for anomalies during mining anomalies, the upper of the number of anomalies is estimated by the Chebyshev theorem. Finally, the proposed method was verified on synthetic datasets and real-world datasets. Experimental results show that the proposed method outperforms the existing state-of-the-art detection methods in regard to the accuracy of mining anomalies and the ability of noise resistance. We find that feature extractors can improve the ability of noise resistance for anomalous detection methods. In the feature space reconstructed by probability distribution, anomalous features are easily identified from irrelevant features and normal features. We also indicate that irrelevant attributes increase the complexity of the feature space, through calculating the probability distribution of data in the background space, the layered features can be extracted to distinguish anomaly classes, normal classes, and irrelevant attribute classes. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:20
相关论文
共 91 条
  • [31] Reducing the dimensionality of data with neural networks
    Hinton, G. E.
    Salakhutdinov, R. R.
    [J]. SCIENCE, 2006, 313 (5786) : 504 - 507
  • [32] Anomaly Detection for a Water Treatment System Using Unsupervised Machine Learning
    Inoue, Jun
    Yamagata, Yoriyuki
    Chen, Yuqi
    Poskitt, Christopher M.
    Sun, Jun
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 1058 - 1065
  • [33] Unmasking the abnormal events in video
    Ionescu, Radu Tudor
    Smeureanu, Sorina
    Alexe, Bogdan
    Popescu, Marius
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2914 - 2922
  • [34] Javaid A., 2016, P 9 EAI INT C BIOINS, DOI DOI 10.4108/EAI.3-12-2015.2262516
  • [35] Optimizing Over Radial Kernels on Compact Manifolds
    Jayasumana, Sadeep
    Hartley, Richard
    Salzmann, Mathieu
    Li, Hongdong
    Harandi, Mehrtash
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3802 - 3809
  • [36] Jerone T.A., 2016, INT J MACH LEARN COM, V6
  • [37] Multi-Class Supervised Novelty Detection
    Jumutc, Vilen
    Suykens, Johan A. K.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) : 2510 - 2523
  • [38] Kantorovich Leonid Vitalevich, 1948, USP MAT NAUK, P225, DOI [10.1007/s10958-006-0050-9, DOI 10.1007/S10958-006-0050-9]
  • [39] Deep learning with support vector data description
    Kim, Sangwook
    Choi, Yonghwa
    Lee, Minho
    [J]. NEUROCOMPUTING, 2015, 165 : 111 - 117
  • [40] Kingma D P., 2014, P INT C LEARN REPR