FuseAD: Unsupervised Anomaly Detection in Streaming Sensors Data by Fusing Statistical and Deep Learning Models

被引:68
作者
Munir, Mohsin [1 ,2 ]
Siddiqui, Shoaib Ahmed [1 ,2 ]
Chattha, Muhammad Ali [1 ,2 ,3 ]
Dengel, Andreas [1 ,2 ]
Ahmed, Sheraz [1 ]
机构
[1] German Res Ctr Artificial Intelligence DFKI GmbH, D-67663 Kaiserslautern, Germany
[2] Tech Univ Kaiserslautern, Fachbereich Informat, D-67663 Kaiserslautern, Germany
[3] NUST, SEECS, Islamabad 44000, Pakistan
关键词
time-series analysis; anomaly detection; deep neural networks; statistical models; model fusion; sensor data; TIME; OUTLIERS;
D O I
10.3390/s19112451
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The need for robust unsupervised anomaly detection in streaming data is increasing rapidly in the current era of smart devices, where enormous data are gathered from numerous sensors. These sensors record the internal state of a machine, the external environment, and the interaction of machines with other machines and humans. It is of prime importance to leverage this information in order to minimize downtime of machines, or even avoid downtime completely by constant monitoring. Since each device generates a different type of streaming data, it is normally the case that a specific kind of anomaly detection technique performs better than the others depending on the data type. For some types of data and use-cases, statistical anomaly detection techniques work better, whereas for others, deep learning-based techniques are preferred. In this paper, we present a novel anomaly detection technique, FuseAD, which takes advantage of both statistical and deep-learning-based approaches by fusing them together in a residual fashion. The obtained results show an increase in area under the curve (AUC) as compared to state-of-the-art anomaly detection methods when FuseAD is tested on a publicly available dataset (Yahoo Webscope benchmark). The obtained results advocate that this fusion-based technique can obtain the best of both worlds by combining their strengths and complementing their weaknesses. We also perform an ablation study to quantify the contribution of the individual components in FuseAD, i.e., the statistical ARIMA model as well as the deep-learning-based convolutional neural network (CNN) model.
引用
收藏
页数:15
相关论文
共 48 条
[1]  
Adams Ryan Prescott, 2007, Bayesian online changepoint detection
[2]  
Aggarwal C C., 2016, OUTLIER ANAL, P1
[3]   Unsupervised real-time anomaly detection for streaming data [J].
Ahmad, Subutai ;
Lavin, Alexander ;
Purdy, Scott ;
Agha, Zuha .
NEUROCOMPUTING, 2017, 262 :134-147
[4]   Green IoT: An Investigation on Energy Saving Practices for 2020 and Beyond [J].
Arshad, Rushan ;
Zahoor, Saman ;
Shah, Munam Ali ;
Wahid, Abdul ;
Yu, Hongnian .
IEEE ACCESS, 2017, 5 :15667-15681
[5]  
Beghi A, 2015, IEEE INTL CONF CONTR, P966, DOI 10.1109/CCA.2015.7320737
[6]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[7]   DeepAD: A Generic Framework Based on Deep Learning for Time Series Anomaly Detection [J].
Buda, Teodora Sandra ;
Caglayan, Bora ;
Assem, Haytham .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 :577-588
[8]   Data analytics for occupancy pattern learning to reduce the energy consumption of HVAC systems in office buildings [J].
Capozzoli, Alfonso ;
Piscitelli, Marco Savino ;
Gorrino, Alice ;
Ballarini, Ilaria ;
Corrado, Vincenzo .
SUSTAINABLE CITIES AND SOCIETY, 2017, 35 :191-208
[9]  
Chalapathy R, 2019, ARXIV190103407
[10]  
Chauhan V, 2015, 2015 INTERNATIONAL CONFERENCE ON RECENT DEVELOPMENTS IN CONTROL, AUTOMATION AND POWER ENGINEERING (RDCAPE), P10, DOI 10.1109/RDCAPE.2015.7281361