A Semi-supervised Approach for Industrial Anomaly Detection via Self-adaptive Clustering

被引:5
|
作者
Ma, Xiaoxue [1 ]
Keung, Jacky [1 ]
He, Pinjia [2 ]
Xiao, Yan [3 ]
Yu, Xiao [4 ]
Li, Yishu [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen 518172, Peoples R China
[3] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[4] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
关键词
Intelligent anomaly detection; clustering; transformer; deep learning;
D O I
10.1109/TII.2023.3280246
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of the Industrial Internet of Things (IIoT), log-based anomaly detection has become vital for smart industrial construction that has prompted many researchers to contribute. To detect anomalies based on log data, semi-supervised approaches stand out from supervised and unsupervised approaches because they only require a portion of labeled data and are relatively stable. However, the state-of-the-art semi-supervised approaches still suffer from two main problems: manual parameter setting and unsatisfactory performance with high false positives. We propose AdaLog, an integrated semi-supervised approach based on self-adaptive clustering, for industrial anomaly detection. In particular, the clustering step performs automatic label probability estimation by distinguishing twelve situations so that the label probability of each unlabeled data can be carefully calculated, leading to high accuracy. In addition, AdaLog employs a pre-trained model to learn contextual information comprehensively and a transformer-based model to detect anomalies efficiently. To alleviate class imbalance, an undersampling method is incorporated. The results on three popular datasets demonstrate that AdaLog significantly outperforms three state-of-the-art semi-supervised approaches by 17.8%-2,489.8% on average in terms of F1-score, and is even superior to two supervised approaches in most cases with average improvements of 10.9%-23.8%.
引用
收藏
页码:1687 / 1697
页数:11
相关论文
共 50 条
  • [1] Industrial Pumps Anomaly Detection and Semi-supervised Anomalies Labeling Through a Cascaded Clustering Approach
    Duan, Qiang
    Jiang, Zhihang
    Li, Wei
    Jiang, Kai
    Jin, Weiduo
    Yu, Ling
    Jiang, Mengmeng
    Zhao, Jing
    Li, Rui
    Zhang, Hui
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL. 2, 2023, 448 : 363 - 373
  • [2] Network anomaly detection based on semi-supervised clustering
    Wei Xiaotao
    Huang Houkuan
    Tian Shengfeng
    NEW ADVANCES IN SIMULATION, MODELLING AND OPTIMIZATION (SMO '07), 2007, : 440 - +
  • [3] Semi-Supervised Anomaly Detection Via Neural Process
    Zhou, Fan
    Wang, Guanyu
    Zhang, Kunpeng
    Liu, Siyuan
    Zhong, Ting
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10423 - 10435
  • [4] Semi-Supervised Statistical Approach for Network Anomaly Detection
    Aissa, Naila Belhadj
    Guerroumia, Mohamed
    7TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2016) / THE 6TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2016) / AFFILIATED WORKSHOPS, 2016, 83 : 1090 - 1095
  • [5] SSCL: Semi-supervised Contrastive Learning for Industrial Anomaly Detection
    Cai, Wei
    Gao, Jiechao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 100 - 112
  • [6] A Semi-supervised Self-Adaptive Classifier over Opinionated Streams
    Zimmermann, Max
    Ntoutsi, Eirini
    Spiliopoulou, Myra
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 425 - 432
  • [7] Self-adaptive and dynamic clustering for online anomaly detection
    Lee, Seungmin
    Kim, Gisung
    Kim, Sehun
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 14891 - 14898
  • [8] Incremental Clustering for Semi-Supervised Anomaly Detection applied on Log Data
    Wurzenberger, Markus
    Skopik, Florian
    Landauer, Max
    Greitbauer, Philipp
    Fiedler, Roman
    Kastner, Wolfgang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY (ARES 2017), 2017,
  • [9] An adaptive semi-supervised clustering approach via multiple density-based information
    Yang, Yun
    Li, Zongze
    Wang, Wei
    Tao, Dapeng
    NEUROCOMPUTING, 2017, 257 : 193 - 205
  • [10] GANomaly: Semi-supervised Anomaly Detection via Adversarial Training
    Akcay, Samet
    Atapour-Abarghouei, Amir
    Breckon, Toby P.
    COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 622 - 637