A Semi-supervised Approach for Industrial Anomaly Detection via Self-adaptive Clustering

被引:5
|
作者
Ma, Xiaoxue [1 ]
Keung, Jacky [1 ]
He, Pinjia [2 ]
Xiao, Yan [3 ]
Yu, Xiao [4 ]
Li, Yishu [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen 518172, Peoples R China
[3] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[4] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
关键词
Intelligent anomaly detection; clustering; transformer; deep learning;
D O I
10.1109/TII.2023.3280246
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of the Industrial Internet of Things (IIoT), log-based anomaly detection has become vital for smart industrial construction that has prompted many researchers to contribute. To detect anomalies based on log data, semi-supervised approaches stand out from supervised and unsupervised approaches because they only require a portion of labeled data and are relatively stable. However, the state-of-the-art semi-supervised approaches still suffer from two main problems: manual parameter setting and unsatisfactory performance with high false positives. We propose AdaLog, an integrated semi-supervised approach based on self-adaptive clustering, for industrial anomaly detection. In particular, the clustering step performs automatic label probability estimation by distinguishing twelve situations so that the label probability of each unlabeled data can be carefully calculated, leading to high accuracy. In addition, AdaLog employs a pre-trained model to learn contextual information comprehensively and a transformer-based model to detect anomalies efficiently. To alleviate class imbalance, an undersampling method is incorporated. The results on three popular datasets demonstrate that AdaLog significantly outperforms three state-of-the-art semi-supervised approaches by 17.8%-2,489.8% on average in terms of F1-score, and is even superior to two supervised approaches in most cases with average improvements of 10.9%-23.8%.
引用
收藏
页码:1687 / 1697
页数:11
相关论文
共 50 条
  • [31] A Semi-Supervised Learning Approach for Network Anomaly Detection in Fog Computing
    Xu, Shengjie
    Qian, Yi
    Hu, Rose Qingyang
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [32] Self-adaptive Local Fisher Discriminant Analysis for semi-supervised image recognition
    Liu, Zhonghua
    Wang, Jingyan
    Man, Jiaju
    Li, Yongping
    You, Xinge
    Wang, Chao
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2012, 4 (04) : 338 - 356
  • [33] Self-adaptive Local Fisher Discriminant Analysis for semi-supervised image recognition
    Li, Y. (ypli@sinap.ac.cn), 1600, Inderscience Enterprises Ltd. (04):
  • [34] Semi-Supervised Isolation Forest for Anomaly Detection
    Stradiotti, Luca
    Perini, Lorenzo
    Davis, Jesse
    PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 670 - 678
  • [35] Domain adaptive deep semi-supervised transfer learning for anomaly detection in OpenWiFi
    Kuili, Samhita
    Kantarci, Burak
    Chenier, Marcel
    Erol-Kantarci, Melike
    Herscovici, Bernard
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 774 - 779
  • [36] A Semi-supervised Fuzzy Clustering Approach via Modifications of the DBSCAN Algorithm
    Bedalli, Erind
    Mancellari, Enea
    Rada, Rexhep
    10TH INTERNATIONAL CONFERENCE ON THEORY AND APPLICATION OF SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTIONS - ICSCCW-2019, 2020, 1095 : 229 - 236
  • [37] On semi-supervised clustering via multiobjective optimization
    Handl, Julia
    Knowles, Joshua
    GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1465 - +
  • [38] A Semi-supervised Clustering via Orthogonal Projection
    Cui Peng
    Zhang Ru-bo
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL II, 2009, : 356 - 359
  • [39] Simultaneously Learning Adaptive Neighbors and Clustering Label via Semi-Supervised NMF
    Cai, Hao
    Liu, Bo
    Xiao, Yanshan
    Lin, Luyue
    Zhou, Sujuan
    Che, Zhiyong
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [40] FldtMatch: Improving Unbalanced Data Classification via Deep Semi-Supervised Learning with Self-Adaptive Dynamic Threshold
    Wu, Xin
    Xu, Jingjing
    Li, Kuan
    Yin, Jianping
    Xiong, Jian
    MATHEMATICS, 2025, 13 (03)