Research of Detection Algorithm for Time Series Abnormal Subsequence

被引:3
作者
Zhang, Chunkai [1 ]
Liu, Haodong [1 ]
Yin, Ao [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci & Technol, Shenzhen, Peoples R China
来源
DATA SCIENCE, PT 1 | 2017年 / 727卷
基金
国家高技术研究发展计划(863计划);
关键词
Time series representation; Abnormal subsequence; K nearest neighbor;
D O I
10.1007/978-981-10-6385-5_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent advancements in sensor technology have made it possible to collect enormous amounts of data in real time. How to find out unusual pattern from time series data plays a very important role in data mining. In this paper, we focus on the abnormal subsequence detection. The original definition of discord subsequences is defective for some kind of time series, in this paper we give a more robust definition which is based on the k nearest neighbors. We also donate a novel method for time series representation, it has better performance than traditional methods (like PAA/SAX) to represent the characteristic of some special time series. To speed up the process of abnormal subsequence detection, we used the clustering method to optimize the outer loop ordering and early abandon subsequence which is impossible to be abnormal. The experiment results validate that the algorithm is correct and has a high efficiency.
引用
收藏
页码:12 / 26
页数:15
相关论文
共 16 条
[1]  
Bentley JL, 1997, PROCEEDINGS OF THE EIGHTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P360
[2]   Efficient time series matching by wavelets [J].
Chan, KP ;
Fu, AWC .
15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, :126-133
[3]  
Chen ZX, 2003, LECT NOTES COMPUT SC, V2737, P234
[4]  
Duchene F, 2004, MINING HETEROGENEOUS
[5]  
Faloutsos C., 1994, SIGMOD Record, V23, P419, DOI 10.1145/191843.191925
[6]  
Hawkins D.M., 1980, Identification of Outliers. Monographs on Applied Probability and Statistics, DOI [DOI 10.1007/978-94-015-3994-4_1, 10.1007/978-94-015-3994-4, 10.1007/978-94-015-3994-41, DOI 10.1007/978-94-015-3994-4]
[7]  
Izakian H, 2013, 2013 JOINT IFSA WORL
[8]   HOT SAX: Efficiently finding the most unusual time series subsequence [J].
Keogh, E ;
Lin, J ;
Fu, AD .
Fifth IEEE International Conference on Data Mining, Proceedings, 2005, :226-233
[9]   Locally adaptive dimensionality reduction for indexing large time series databases [J].
Keogh, E ;
Chakrabarti, K ;
Mehrotra, S ;
Pazzani, M .
SIGMOD RECORD, 2001, 30 (02) :151-162
[10]   Finding time series discord based on bit representation clustering [J].
Li, Guiling ;
Braysy, Olli ;
Jiang, Liangxiao ;
Wu, Zongda ;
Wang, Yuanzhen .
KNOWLEDGE-BASED SYSTEMS, 2013, 54 :243-254