Self-supervised anomaly pattern detection for large scale industrial data

被引:14
作者
Tang, Xiaoyue [1 ]
Zeng, Shan [1 ]
Yu, Fang [2 ]
Yu, Wei [2 ]
Sheng, Zhongyin [1 ]
Kang, Zhen [1 ]
机构
[1] Wuhan Polytechn Univ, Sch Math & Comp Sci, Wuhan 430023, Hubei, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
关键词
Data augmentation; Anomaly detection; Industrial data;
D O I
10.1016/j.neucom.2022.09.069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting the anomalies in a large amounts of high-dimensional data has been a challenging task. In the Industry 4.0 environment, large-scale high-dimensional monitoring data features the complex pattern of high level semantics. In order to provide enterprise-wide monitoring solutions, it is necessary to identify the high-level semantic patterns of the anomalies in these data without splitting them. Existing end-to-end deep neural networks for time series are capable of recognizing the high-level semantics in natural language or speech signals, but they are barely applied in real-time anomaly detection of industrial data because of the large time costs. In this paper, we leverage the self-supervised contrastive learning methodology and propose a Composite Semantic Augmentation Encoder (CSAE) to provide an appropriate representation of industrial data and implement quick detection of anomalies in industrial application environments. CSAE is a non-sequential deep neural network with two augmentation layers and a mandatory layer. The two layers of data-augmentation are built to expand the size of samples of both low-level semantic anomalies and high-level semantic anomalies, which enables CSAE to discover diverse anomalies and improves its accuracy of high-level semantic pattern recognition. The mandatory layer is built to compress and reserve the temporal information in the industrial data to accelerate the anomaly detection. Therefore, as a non-sequential contrastive learning model, CSAE has faster training convergence than the usual sequence models. The experiment results have verified that CSAE can achieve higher prediction accuracy with less time consumption than existing machine learning models in the tasks of high dimensional anomaly pattern detection. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 22 条
[1]  
Boschert Stefan, 2016, Mechatronic Futures: Challenges and Solutions for Mechatronic Systems and Their Designers, P59, DOI [DOI 10.1007/978-3-319-32156-1_5, DOI 10.1007/978-3-319-32156-15]
[2]  
Chen Ting, 2020, ICML
[3]   Asynchronous Fault Detection Observer for 2-D Markov Jump Systems [J].
Cheng, Peng ;
Wang, Hai ;
Stojanovic, Vladimir ;
He, Shuping ;
Shi, Kaibo ;
Luan, Xiaoli ;
Liu, Fei ;
Sun, Changyin .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) :13623-13634
[4]  
Cheng PY, 2020, PR MACH LEARN RES, V119
[5]   Emerging Trends Word2Vec [J].
Church, Kenneth Ward .
NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) :155-162
[6]   Subspace method aided data-driven design of fault detection and isolation systems [J].
Ding, S. X. ;
Zhang, P. ;
Naik, A. ;
Ding, E. L. ;
Huang, B. .
JOURNAL OF PROCESS CONTROL, 2009, 19 (09) :1496-1510
[7]   DATA-DRIVEN CONTROL OF HYDRAULIC SERVO ACTUATOR BASED ON ADAPTIVE DYNAMIC PROGRAMMING [J].
Djordjevic, Vladimir ;
Stojanovic, Vladimir ;
Tao, Hongfeng ;
Song, Xiaona ;
He, Shuping ;
Gao, Weinan .
DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES S, 2022, 15 (07) :1633-1650
[8]  
Edwards C, 2021, COMMUN ACM, V64, P9
[9]  
Guo YD, 2022, PR MACH LEARN RES
[10]   From model-based control to data-driven control: Survey, classification and perspective [J].
Hou, Zhong-Sheng ;
Wang, Zhuo .
INFORMATION SCIENCES, 2013, 235 :3-35