Random pairwise shapelets forest: an effective classifier for time series

被引:8
作者
Yuan, Jidong [1 ]
Shi, Mohan [2 ]
Wang, Zhihai [1 ]
Liu, Haiyang [1 ]
Li, Jinyang [3 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jingdong 360 Degree E Commerce Co Ltd, Beijing, Peoples R China
[3] Univ Hong Kong, Dept Comp Sci, Pok Fu Lam, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Time series classification; Pairwise shapelets; Random forest; Decomposed mean decrease impurity; REPRESENTATION; SIMILARITY; FEATURES;
D O I
10.1007/s10115-021-01630-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Shapelet is a discriminative subsequence of time series. An advanced shapelet-based method is to embed shapelet into the accurate and fast random forest. However, there are several limitations. First, random shapelet forest requires a large training cost for split threshold searching. Second, a single shapelet provides limited information for only one branch of the decision tree, resulting in insufficient accuracy. Third, the randomized ensemble decreases comprehensibility. For that, this paper presents Random Pairwise Shapelets Forest (RPSF). RPSF combines a pair of shapelets from different classes to construct random forest. It omits threshold searching to be more efficient, includes more information about each node of the forest to be more effective. Moreover, a discriminability measure, Decomposed Mean Decrease Impurity, is proposed to identify the influential region for each class. Extensive experiments show that RPSF is competitive compared with other methods, while it improves the training speed of shapelet-based forest.
引用
收藏
页码:143 / 174
页数:32
相关论文
共 52 条
[1]  
[Anonymous], 2001, SDM, DOI DOI 10.1137/1.9781611972719.1
[2]   The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances [J].
Bagnall, Anthony ;
Lines, Jason ;
Bostrom, Aaron ;
Large, James ;
Keogh, Eamonn .
DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) :606-660
[3]   Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles [J].
Bagnall, Anthony ;
Lines, Jason ;
Hills, Jon ;
Bostrom, Aaron .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (09) :2522-2535
[4]   CID: an efficient complexity-invariant distance for time series [J].
Batista, Gustavo E. A. P. A. ;
Keogh, Eamonn J. ;
Tataw, Oben Moses ;
de Souza, Vinicius M. A. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (03) :634-669
[5]   Time series representation and similarity based on local autopatterns [J].
Baydogan, Mustafa Gokce ;
Runger, George .
DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (02) :476-509
[6]   A Bag-of-Features Framework to Classify Time Series [J].
Baydogan, Mustafa Gokce ;
Runger, George ;
Tuv, Eugene .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2796-2802
[7]   Binary Shapelet Transform for Multiclass Time Series Classification [J].
Bostrom, Aaron ;
Bagnall, Anthony .
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, 2015, 9263 :257-269
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
Cetin MustafaS., 2015, Proceedings of the 2015 SIAM International Conference on Data Mining, P307
[10]  
Cui Z., 2016, arXiv