A Semi-supervised Approach for Industrial Anomaly Detection via Self-adaptive Clustering

被引：5

作者：

Ma, Xiaoxue ^{[1
]}

Keung, Jacky ^{[1
]}

He, Pinjia ^{[2
]}

Xiao, Yan ^{[3
]}

Yu, Xiao ^{[4
]}

Li, Yishu ^{[1
]}

机构：

[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[2] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen 518172, Peoples R China

[3] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China

[4] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2024年 / 20卷 / 02期

关键词：

Intelligent anomaly detection; clustering; transformer; deep learning;

D O I：

10.1109/TII.2023.3280246

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of the Industrial Internet of Things (IIoT), log-based anomaly detection has become vital for smart industrial construction that has prompted many researchers to contribute. To detect anomalies based on log data, semi-supervised approaches stand out from supervised and unsupervised approaches because they only require a portion of labeled data and are relatively stable. However, the state-of-the-art semi-supervised approaches still suffer from two main problems: manual parameter setting and unsatisfactory performance with high false positives. We propose AdaLog, an integrated semi-supervised approach based on self-adaptive clustering, for industrial anomaly detection. In particular, the clustering step performs automatic label probability estimation by distinguishing twelve situations so that the label probability of each unlabeled data can be carefully calculated, leading to high accuracy. In addition, AdaLog employs a pre-trained model to learn contextual information comprehensively and a transformer-based model to detect anomalies efficiently. To alleviate class imbalance, an undersampling method is incorporated. The results on three popular datasets demonstrate that AdaLog significantly outperforms three state-of-the-art semi-supervised approaches by 17.8%-2,489.8% on average in terms of F1-score, and is even superior to two supervised approaches in most cases with average improvements of 10.9%-23.8%.

引用

页码：1687 / 1697

页数：11

共 50 条

[41] A genetic algorithm approach for semi-supervised clustering
Demiriz, Ayhan
Bennett, Kristin P.
Embrechts, Mark J.
International Journal of Smart Engineering System Design, 2002, 4 (01): : 21 - 30
[42] Semi-supervised graph clustering: a kernel approach
Brian Kulis
Sugato Basu
Inderjit Dhillon
Raymond Mooney
Machine Learning, 2009, 74 : 1 - 22
[43] Semi-supervised graph clustering: a kernel approach
Kulis, Brian
Basu, Sugato
Dhillon, Inderjit
Mooney, Raymond
MACHINE LEARNING, 2009, 74 (01) : 1 - 22
[44] Semi-supervised anomaly traffic detection via multi-frequency reconstruction
Lian, Xinglin
Zheng, Yu
Dang, Zhangxuan
Peng, Chunlei
Gao, Xinbo
PATTERN RECOGNITION, 2025, 161
[45] Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation
Li, Jichang
Li, Guanbin
Yu, Yizhou
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5580 - 5594
[46] Adaptive and structured graph learning for semi-supervised clustering
Chen, Long
Zhong, Zhi
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)
[47] A Semi-Supervised Bayesian Anomaly Detection Technique for Diagnosing Faults in Industrial IoT Systems
De Vita, Fabrizio
Bruneo, Dario
Das, Sajal K.
2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 31 - 38
[48] SAKMR: Industrial control anomaly detection based on semi-supervised hybrid deep learning
Shijie Tang
Yong Ding
Meng Zhao
Huiyong Wang
Peer-to-Peer Networking and Applications, 2024, 17 : 612 - 623
[49] Adaptive safety-aware semi-supervised clustering
Gan, Haitao
Yang, Zhi
Zhou, Ran
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
[50] An interpretable semi-supervised system for detecting cyberattacks using anomaly detection in industrial scenarios
Gomez, Angel Luis Perales
Maimo, Lorenzo Fernandez
Celdran, Alberto Huertas
Clemente, Felix J. Garcia
IET INFORMATION SECURITY, 2023, 17 (04) : 553 - 566

← 1 2 3 4 5 →