Outlier Detection for Multidimensional Time Series using Deep Neural Networks

被引:141
作者
Kieu, Tung [1 ]
Yang, Bin [1 ]
Jensen, Christian S. [1 ]
机构
[1] Aalborg Univ, Dept Comp Sci, Aalborg, Denmark
来源
2018 19TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2018) | 2018年
关键词
D O I
10.1109/MDM.2018.00029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the continued digitization of industrial and societal processes, including the deployment of networked sensors, we are witnessing a rapid proliferation of time-ordered observations, known as time series. For example, the behavior of drivers can be captured by GPS or accelerometer as a time series of speeds, directions, and accelerations. We propose a framework for outlier detection in time series that, for example, can be used for identifying dangerous driving behavior and hazardous road locations. Specifically, we first propose a method that generates statistical features to enrich the feature space of raw time series. Next, we utilize an autoencoder to reconstruct the enriched time series. The autoencoder performs dimensionality reduction to capture, using a small feature space, the most representative features of the enriched time series. As a result, the reconstructed time series only capture representative features, whereas outliers often have non-representative features. Therefore, deviations of the enriched time series from the reconstructed time series can be taken as indicators of outliers. We propose and study autoencoders based on convolutional neural networks and long-short term memory neural networks. In addition, we show that embedding of contextual information into the framework has the potential to further improve the accuracy of identifying outliers. We report on empirical studies with multiple time series data sets, which offers insight into the design properties of the proposed framework, indicating that it is effective at detecting outliers.
引用
收藏
页码:125 / 134
页数:10
相关论文
共 48 条
[1]  
Agerholm N., 2015, ICTCT
[2]  
[Anonymous], 1997, Neural Computation
[3]  
[Anonymous], 2009, ICML
[4]   A Flexible System Architecture for Acquisition and Storage of Naturalistic Driving Data [J].
Bender, Asher ;
Ward, James R. ;
Worrall, Stewart ;
Moreyra, Marcelo L. ;
Gerling Konrad, Santiago ;
Masson, Favio ;
Nebot, Eduardo M. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (06) :1748-1761
[5]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[6]  
Cabrera JBD, 2001, SIGMOD REC, V30, P25, DOI 10.1145/604264.604269
[7]   Path Cost Distribution Estimation Using Trajectory Data [J].
Dai, Jian ;
Yang, Bin ;
Guo, Chenjuan ;
Jensen, Christian S. ;
Hu, Jilin .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 10 (03) :85-96
[8]  
Dai J, 2015, PROC INT CONF DATA, P543, DOI 10.1109/ICDE.2015.7113313
[9]   Enabling Smart Transportation Systems: A Parallel Spatio-Temporal Database Approach [J].
Ding, Zhiming ;
Yang, Bin ;
Chi, Yuanying ;
Guo, Limin .
IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (05) :1377-1391
[10]   Network-Matched Trajectory-Based Moving-Object Database: Models and Applications [J].
Ding, Zhiming ;
Yang, Bin ;
Gueting, Ralf Hartmut ;
Li, Yaguang .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) :1918-1928