Fast Time Series Classification Based on Infrequent Shapelets

被引:26
作者
He, Qing [1 ]
Dong, Zhi [1 ,2 ]
Zhuang, Fuzhen [1 ,2 ]
Shang, Tianfeng [1 ,2 ]
Shi, Zhongzhi [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100190, Peoples R China
来源
2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1 | 2012年
基金
中国国家自然科学基金;
关键词
Time series; Infrequent shapelet; Classification; Decision Tree;
D O I
10.1109/ICMLA.2012.44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series shapelets are small and local time series subsequences which are in some sense maximally representative of a class. E. Keogh uses distance of the shapelet to classify objects. Even though shapelet classification can be interpretable and more accurate than many state-of-the-art classifiers, there is one main limitation of shapelets, i.e. shapelet classification training process is offline, and uses subsequence early abandon and admissible entropy pruning strategies, the time to compute is still significant. In this work, we address the later problem by introducing a novel algorithm that finds time series shapelet in significantly less time than the current methods by extracting infrequent time series shapelet candidates. Subsequences that are distinguishable are usually infrequent compared to other subsequences. The algorithm called ISDT (Infrequent Shapelet Decision Tree) uses infrequent shapelet candidates extracting to find shapelet. Experiments demonstrate the efficiency of ISDT algorithm on several benchmark time series datasets. The result shows that ISDT significantly outperforms the current shapelet algorithm.
引用
收藏
页码:215 / 219
页数:5
相关论文
共 7 条
[1]  
Brutlag J. D., 2000, P 14 USENIX C SYST A
[2]  
Keogh E., 2006, The ucr time series classification/clustering home-page
[3]  
Lee Chao-Hui, 2010, P COMP METH PROGR BI, P44
[4]  
Morik K, 2004, LECT NOTES ARTIF INT, V3202, P325
[5]  
Xing Z., 2011, P SIAM INT C DAT MIN
[6]  
Ye L., 2011, P DAT MIN KNOWL DISC, P149
[7]  
Ye LX, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P947