Deep probabilistic graphical modeling for robust multivariate time series anomaly detection with missing data

被引:18
作者
Yang, Jingyu [1 ]
Yue, Zuogong [1 ]
Yuan, Ye [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automation, Key Lab Image Proc & Intelligent Control, Minist Educ, Wuhan 430074, Peoples R China
关键词
Multivariate time series; Anomaly detection; Missing data; Probabilistic graphical model; Expectation maximization; NETWORK;
D O I
10.1016/j.ress.2023.109410
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Multivariate time series anomaly detection with missing data is one of the most pending issues for industrial monitoring. Due to scarcity of labeled anomalies, most advanced data-driven anomaly detection approaches fall in the unsupervised learning paradigm. As a premise in the presence of missing data, one needs to improve the data quality through data imputation with a separate model. Our concern lies in the consistency between data imputation and unsupervised learning for robust anomaly detection, regarding accurately discovering the spatiotemporal dependence among multiple variables over time. However, the existing practice tends to overlook this consistency and decouple the training process for these two closely linked tasks. This article novelly proposes a probabilistic multivariate time series anomaly detection framework that unifies data imputation and unsupervised learning. A deep probabilistic graphical model abbreviated SCNF is first devised for unsupervised density estimation. A tailored expectation maximization-based optimization scheme is then developed to achieve the joint training of data imputation and unsupervised learning with missing data. The efficacy is experimentally corroborated in several industrial applications, including chemical process, water treatment and network traffic. Briefly, the joint training framework enhances the AUROC of SCNF by averagely 6.34% for three applications under 50% data missing rate.
引用
收藏
页数:13
相关论文
共 66 条
[1]  
Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P1
[2]  
[Anonymous], 2017, Proc.Int. J. Hum. Factors Ergon
[3]  
Botes FH, 2017, 16 EUROPEAN C CYBER, P53
[4]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[5]   Bayesian Networks in Fault Diagnosis [J].
Cai, Baoping ;
Huang, Lei ;
Xie, Min .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (05) :2227-2240
[6]  
Cao W, 2018, ADV NEUR IN, V31
[7]   Identification of Two-Dimensional ausal Systems With Missing Output Data via Expectation-Maximization Algorithm [J].
Chen, Jing ;
Huang, Biao ;
Ding, Feng .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (08) :5185-5196
[8]  
Chen WC, 2022, PR MACH LEARN RES
[9]   A Multimodal Anomaly Detector for Robot-Assisted Feeding Using an LSTM-Based Variational Autoencoder [J].
Park, Daehyung ;
Hoshi, Yuuna ;
Kemp, Charles C. .
IEEE Robotics and Automation Letters, 2018, 3 (03) :1544-1551
[10]  
Dai E, 2022, INT C LEARNING REPRE