Robust and Explainable Autoencoders for Unsupervised Time Series Outlier Detection

被引:31
作者
Kieu, Tung [1 ]
Yang, Bin [1 ]
Guo, Chenjuan [1 ]
Jensen, Christian S. [1 ]
Zhao, Yan [1 ]
Huang, Feiteng [2 ]
Zheng, Kai [3 ]
机构
[1] Aalborg Univ, Aalborg, Denmark
[2] Huawei Cloud Database Innovat Lab, Beijing, Peoples R China
[3] Univ Elect Sci & Technol China, Chengdu, Peoples R China
来源
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022) | 2022年
关键词
D O I
10.1109/ICDE53745.2022.00273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series data occurs widely, and outlier detection is a fundamental problem in data mining, which has numerous applications. Existing autoencoder-based approaches deliver state-of-the-art performance on challenging real-world data but are vulnerable to outliers and exhibit low explainability. To address these two limitations, we propose robust and explainable unsupervised autoencoder frameworks that decompose an input time series into a clean time series and an outlier time series using autoencoders. Improved explainability is achieved because clean time series are better explained with easy-to-understand patterns such as trends and periodicities. We provide insight into this by means of a post-hoc explainability analysis and empirical studies. In addition, since outliers are separated from clean time series iteratively, our approach offers improved robustness to outliers, which in turn improves accuracy. We evaluate our approach on five real-world datasets and report improvements over the state-of-the-art approaches in terms of robustness and explainability.
引用
收藏
页码:3038 / 3050
页数:13
相关论文
共 55 条
[1]  
Aggarwal C. C., 2013, OUTLIER ANAL, DOI [DOI 10.1007/978-1-4614-6396-2, 10.1007/978-1-4614-6396-2]
[2]  
[Anonymous], 2001, The elements of statistical learning: data mining, inference and prediction
[3]  
Bin Yang S, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P3286
[4]  
Bin Z, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4433
[5]   Distributed optimization and statistical learning via the alternating direction method of multipliers [J].
Boyd S. ;
Parikh N. ;
Chu E. ;
Peleato B. ;
Eckstein J. .
Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122
[6]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[7]  
Brown R. G., 1963, DOVER PHOENIX EDITIO
[8]   Unsupervised Time Series Outlier Detection with Diversity-Driven Convolutional Ensembles [J].
Campos, David ;
Kieu, Tung ;
Guo, Chenjuan ;
Huang, Feiteng ;
Zheng, Kai ;
Yang, Bin ;
Jensen, Christian S. .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 15 (03) :611-623
[9]   Decoding by linear programming [J].
Candes, EJ ;
Tao, T .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (12) :4203-4215
[10]   Robust Principal Component Analysis? [J].
Candes, Emmanuel J. ;
Li, Xiaodong ;
Ma, Yi ;
Wright, John .
JOURNAL OF THE ACM, 2011, 58 (03)