Unsupervised Anomaly Detection Approach for Time-Series in Multi-Domains Using Deep Reconstruction Error

被引:44
作者
Amarbayasgalan, Tsatsral [1 ]
Van Huy Pham [2 ]
Theera-Umpon, Nipon [3 ,4 ]
Keun Ho Ryu [2 ,4 ]
机构
[1] Chungbuk Natl Univ, Sch Elect & Comp Engn, Database & Bioinformat Lab, Cheongju 28644, South Korea
[2] Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
[3] Chiang Mai Univ, Fac Engn, Dept Elect Engn, Chiang Mai 50200, Thailand
[4] Chiang Mai Univ, Biomed Engn Inst, Chiang Mai 50200, Thailand
来源
SYMMETRY-BASEL | 2020年 / 12卷 / 08期
基金
新加坡国家研究基金会;
关键词
anomaly detection; density-based clustering; deep autoencoder; reconstruction error; unsupervised technique; SELECTION; PCA;
D O I
10.3390/sym12081251
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Automatic anomaly detection for time-series is critical in a variety of real-world domains such as fraud detection, fault diagnosis, and patient monitoring. Current anomaly detection methods detect the remarkably low proportion of the actual abnormalities correctly. Furthermore, most of the datasets do not provide data labels, and require unsupervised approaches. By focusing on these problems, we propose a novel deep learning-based unsupervised anomaly detection approach (RE-ADTS) for time-series data, which can be applicable to batch and real-time anomaly detections. RE-ADTS consists of two modules including the time-series reconstructor and anomaly detector. The time-series reconstructor module uses the autoregressive (AR) model to find an optimal window width and prepares the subsequences for further analysis according to the width. Then, it uses a deep autoencoder (AE) model to learn the data distribution, which is then used to reconstruct a time-series close to the normal. For anomalies, their reconstruction error (RE) was higher than that of the normal data. As a result of this module, RE and compressed representation of the subsequences were estimated. Later, the anomaly detector module defines the corresponding time-series as normal or an anomaly using a RE based anomaly threshold. For batch anomaly detection, the combination of the density-based clustering technique and anomaly threshold is employed. In the case of real-time anomaly detection, only the anomaly threshold is used without the clustering process. We conducted two types of experiments on a total of 52 publicly available time-series benchmark datasets for the batch and real-time anomaly detections. Experimental results show that the proposed RE-ADTS outperformed the state-of-the-art publicly available anomaly detection methods in most cases.
引用
收藏
页数:22
相关论文
共 38 条
[1]  
Adams R.P., 2017, ARXIV07103742
[2]   Unsupervised real-time anomaly detection for streaming data [J].
Ahmad, Subutai ;
Lavin, Alexander ;
Purdy, Scott ;
Agha, Zuha .
NEUROCOMPUTING, 2017, 262 :134-147
[3]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[4]  
Amarbayasgalan T., 2019, DEEP AUTOENCODER BAS, V11721, P237
[5]   Unsupervised Novelty Detection Using Deep Autoencoders with Density Based Clustering [J].
Amarbayasgalan, Tsatsral ;
Jargalsaikhan, Bilguun ;
Ryu, Keun Ho .
APPLIED SCIENCES-BASEL, 2018, 8 (09)
[6]  
[Anonymous], 2013, P ACM SIGKDD WORKSH
[7]   Automatic outlier detection for time series: an application to sensor data [J].
Basu, Sabyasachi ;
Meckesheimer, Martin .
KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 11 (02) :137-154
[8]   Semantic-Emotion Neural Network for Emotion Recognition From Text [J].
Batbaatar, Erdenebileg ;
Li, Meijing ;
Ryu, Keun Ho .
IEEE ACCESS, 2019, 7 :111866-111878
[9]  
Breunig M.M., 2000, SIGMODPODS00 ACM INT
[10]   Model Selection Techniques An overview [J].
Ding, Jie ;
Tarokh, Vahid ;
Yang, Yuhong .
IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (06) :16-34