Semi-Supervised Learning-Based Method for Unknown Anomaly Detection

被引:0
作者
Cheng, Yudong [1 ]
Zhou, Fang [1 ]
机构
[1] School of Data Science and Engineering, East China Normal University, Shanghai
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2024年 / 61卷 / 07期
关键词
anomaly detection; out-of-distribution detection; semi-supervised learning; tabular data; unseen anomalies;
D O I
10.7544/issn1000-1239.202330627
中图分类号
学科分类号
摘要
Anomaly detection aims to identify data that deviates from expected behavior patterns. Despite the potential of semi-supervised anomaly detection methods in enhancing detection accuracy by utilizing a limited amount of labeled data as prior knowledge, the labeled anomalies (i.e., seen anomalies) acquired are unlikely to cover all types of anomalies. In real-world scenarios, novel types of anomalies (i.e., unseen anomalies) often emerge, which may exhibit distinct characteristics from the known anomalies, thereby rendering them challenging to detect using existing semi-supervised anomaly detection methods. To address this issue, we propose a semi-supervised unknown anomaly detection (SSUAD) method, aimed at simultaneously identifying both known and unseen anomalies. This method utilizes a closed-set classifier for the classification of known anomalies and normal instances, and an unknown anomaly detector for the detection of unseen anomalies. Moreover, considering the extreme imbalance between anomalies and normal instances in the anomaly detection scenario, we design an effective data augmentation strategy to increase the number of anomaly samples. Experiments are conducted on UNSW-NB15 and KDDCUP99 datasets, as well as a real-world dataset SQB. The results reveal that, compared with existing anomaly detection methods, SSUAD exhibits significant improvement in the anomaly detection performance metrics AUC-ROC and AUC-PR, thereby verifying the effectiveness and reasonableness of the proposed method. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1670 / 1680
页数:10
相关论文
共 28 条
  • [1] Dal Pozzolo A, Boracchi G, Caelen O, Et al., Credit card fraud detection: A realistic modeling and a novel learning strategy[J], IEEE Transactions on Neural Networks and Learning Systems, 29, 8, pp. 3784-3797, (2017)
  • [2] Liao H J, Lin C H R, Lin Y C, Et al., Intrusion detection system: A comprehensive review[J], Journal of Network and Computer Applications, 36, 1, (2013)
  • [3] Fernandes G, Rodrigues J J P C, Carvalho L F, Et al., A comprehensive survey on network anomaly detection[J], Telecommunication Systems, 70, 3, (2019)
  • [4] Guansong Pang, Chunhua Shen, Longbing Cao, Et al., Deep learning for anomaly detection: A review[J], ACM Computing Surveys, 54, 2, pp. 1-38, (2021)
  • [5] Ding Kaize, Zhou Qinghai, Tong Hanghang, Et al., Few-shot network anomaly detection via cross-network meta-learning[C], Proc of the 30th Int Conf on World Wide Web, pp. 2448-2456, (2021)
  • [6] Pang Guansong, Cao Longbing, Chen Ling, Et al., Learning representations of ultrahigh-dimensional data for random distance-based outlier detection[C], Proc of the 24th Int Conf on Knowledge Discovery and Data Mining, pp. 2041-2050, (2018)
  • [7] Huang Junkai, Fang Chaowei, Chen Weikai, Et al., Trash to treasure: Harvesting OOD data with cross-modal matching for open-set semi-supervised learning, Proc of the 18th IEEE/CVF Int Conf on Computer Vision(ICCV), pp. 8310-8319, (2021)
  • [8] Li C L, Sohn K, Yoon J, Et al., Cutpaste: Self-supervised learning for anomaly detection and localization[C], Proc of the IEEE/CVF Conf on Computer Vision and Pattern Recognition, pp. 9664-9674, (2021)
  • [9] Meng Debin, Peng Xiaojiang, Wang Kai, Et al., Frame attention networks for facial expression recognition in videos[C], Proc of the 28th IEEE Int Conf on Image Processing (ICIP), pp. 3866-3870, (2019)
  • [10] Campos G O, Zimek A, Sander J, Et al., On the evaluation of unsupervised outlier detection: Measures, datasets, and an empirical study[J], Data Mining and Knowledge Discovery, 30, 4, pp. 891-927, (2016)