A Semi-supervised Approach for Industrial Anomaly Detection via Self-adaptive Clustering

被引:5
|
作者
Ma, Xiaoxue [1 ]
Keung, Jacky [1 ]
He, Pinjia [2 ]
Xiao, Yan [3 ]
Yu, Xiao [4 ]
Li, Yishu [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen 518172, Peoples R China
[3] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[4] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
关键词
Intelligent anomaly detection; clustering; transformer; deep learning;
D O I
10.1109/TII.2023.3280246
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of the Industrial Internet of Things (IIoT), log-based anomaly detection has become vital for smart industrial construction that has prompted many researchers to contribute. To detect anomalies based on log data, semi-supervised approaches stand out from supervised and unsupervised approaches because they only require a portion of labeled data and are relatively stable. However, the state-of-the-art semi-supervised approaches still suffer from two main problems: manual parameter setting and unsatisfactory performance with high false positives. We propose AdaLog, an integrated semi-supervised approach based on self-adaptive clustering, for industrial anomaly detection. In particular, the clustering step performs automatic label probability estimation by distinguishing twelve situations so that the label probability of each unlabeled data can be carefully calculated, leading to high accuracy. In addition, AdaLog employs a pre-trained model to learn contextual information comprehensively and a transformer-based model to detect anomalies efficiently. To alleviate class imbalance, an undersampling method is incorporated. The results on three popular datasets demonstrate that AdaLog significantly outperforms three state-of-the-art semi-supervised approaches by 17.8%-2,489.8% on average in terms of F1-score, and is even superior to two supervised approaches in most cases with average improvements of 10.9%-23.8%.
引用
收藏
页码:1687 / 1697
页数:11
相关论文
共 50 条
  • [41] A genetic algorithm approach for semi-supervised clustering
    Demiriz, Ayhan
    Bennett, Kristin P.
    Embrechts, Mark J.
    International Journal of Smart Engineering System Design, 2002, 4 (01): : 21 - 30
  • [42] Semi-supervised graph clustering: a kernel approach
    Brian Kulis
    Sugato Basu
    Inderjit Dhillon
    Raymond Mooney
    Machine Learning, 2009, 74 : 1 - 22
  • [43] Semi-supervised graph clustering: a kernel approach
    Kulis, Brian
    Basu, Sugato
    Dhillon, Inderjit
    Mooney, Raymond
    MACHINE LEARNING, 2009, 74 (01) : 1 - 22
  • [44] Semi-supervised anomaly traffic detection via multi-frequency reconstruction
    Lian, Xinglin
    Zheng, Yu
    Dang, Zhangxuan
    Peng, Chunlei
    Gao, Xinbo
    PATTERN RECOGNITION, 2025, 161
  • [45] Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation
    Li, Jichang
    Li, Guanbin
    Yu, Yizhou
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5580 - 5594
  • [46] Adaptive and structured graph learning for semi-supervised clustering
    Chen, Long
    Zhong, Zhi
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)
  • [47] A Semi-Supervised Bayesian Anomaly Detection Technique for Diagnosing Faults in Industrial IoT Systems
    De Vita, Fabrizio
    Bruneo, Dario
    Das, Sajal K.
    2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 31 - 38
  • [48] SAKMR: Industrial control anomaly detection based on semi-supervised hybrid deep learning
    Shijie Tang
    Yong Ding
    Meng Zhao
    Huiyong Wang
    Peer-to-Peer Networking and Applications, 2024, 17 : 612 - 623
  • [49] Adaptive safety-aware semi-supervised clustering
    Gan, Haitao
    Yang, Zhi
    Zhou, Ran
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [50] An interpretable semi-supervised system for detecting cyberattacks using anomaly detection in industrial scenarios
    Gomez, Angel Luis Perales
    Maimo, Lorenzo Fernandez
    Celdran, Alberto Huertas
    Clemente, Felix J. Garcia
    IET INFORMATION SECURITY, 2023, 17 (04) : 553 - 566