Deep probabilistic graphical modeling for robust multivariate time series anomaly detection with missing data

被引:18
作者
Yang, Jingyu [1 ]
Yue, Zuogong [1 ]
Yuan, Ye [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automation, Key Lab Image Proc & Intelligent Control, Minist Educ, Wuhan 430074, Peoples R China
关键词
Multivariate time series; Anomaly detection; Missing data; Probabilistic graphical model; Expectation maximization; NETWORK;
D O I
10.1016/j.ress.2023.109410
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Multivariate time series anomaly detection with missing data is one of the most pending issues for industrial monitoring. Due to scarcity of labeled anomalies, most advanced data-driven anomaly detection approaches fall in the unsupervised learning paradigm. As a premise in the presence of missing data, one needs to improve the data quality through data imputation with a separate model. Our concern lies in the consistency between data imputation and unsupervised learning for robust anomaly detection, regarding accurately discovering the spatiotemporal dependence among multiple variables over time. However, the existing practice tends to overlook this consistency and decouple the training process for these two closely linked tasks. This article novelly proposes a probabilistic multivariate time series anomaly detection framework that unifies data imputation and unsupervised learning. A deep probabilistic graphical model abbreviated SCNF is first devised for unsupervised density estimation. A tailored expectation maximization-based optimization scheme is then developed to achieve the joint training of data imputation and unsupervised learning with missing data. The efficacy is experimentally corroborated in several industrial applications, including chemical process, water treatment and network traffic. Briefly, the joint training framework enhances the AUROC of SCNF by averagely 6.34% for three applications under 50% data missing rate.
引用
收藏
页数:13
相关论文
共 66 条
[41]   Model-Based Techniques for Data Reliability in Wireless Sensor Networks [J].
Mukhopadhyay, Shoubhik ;
Schurgers, Curt ;
Panigrahi, Debashis ;
Dey, Sujit .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2009, 8 (04) :528-543
[42]   Bayesian inference for Common cause failure rate based on causal inference with missing data [J].
Nguyen, H. D. ;
Gouno, E. .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 197
[43]   Deep Learning for Anomaly Detection: A Review [J].
Pang, Guansong ;
Shen, Chunhua ;
Cao, Longbing ;
Van den Hengel, Anton .
ACM COMPUTING SURVEYS, 2021, 54 (02)
[44]   Compressive Sensing-Based Missing-Data-Tolerant Fault Detection for Remote Condition Monitoring of Wind Turbines [J].
Peng, Yayu ;
Qiao, Wei ;
Qu, Liyan .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (02) :1937-1947
[45]  
RUBIN DB, 1976, BIOMETRIKA, V63, P581, DOI 10.1093/biomet/63.3.581
[46]  
Ruff L, 2018, PR MACH LEARN RES, V80
[47]   Denoising Autoencoder-Based Missing Value Imputation for Smart Meters [J].
Ryu, Seunghyoung ;
Kim, Minsoo ;
Kim, Hongseok .
IEEE ACCESS, 2020, 8 :40656-40666
[48]   Fault diagnosis based on extremely randomized trees in wireless sensor networks [J].
Saeed, Umer ;
Jan, Sana Ullah ;
Lee, Young-Doo ;
Koo, Insoo .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 205
[49]  
Schölkopf B, 2000, ADV NEUR IN, V12, P582
[50]   Support vector data description [J].
Tax, DMJ ;
Duin, RPW .
MACHINE LEARNING, 2004, 54 (01) :45-66