Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles

被引:297
作者
Bagnall, Anthony [1 ]
Lines, Jason [1 ]
Hills, Jon [1 ]
Bostrom, Aaron [1 ]
机构
[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
基金
英国工程与自然科学研究理事会;
关键词
Time series classification; ensemble; shapelet; FOREST;
D O I
10.1109/TKDE.2015.2416723
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, two ideas have been explored that lead to more accurate algorithms for time-series classification (TSC). First, it has been shown that the simplest way to gain improvement on TSC problems is to transform into an alternative data space where discriminatory features are more easily detected. Second, it was demonstrated that with a single data representation, improved accuracy can be achieved through simple ensemble schemes. We combine these two principles to test the hypothesis that forming a collective of ensembles of classifiers on different data transformations improves the accuracy of time-series classification. The collective contains classifiers constructed in the time, frequency, change, and shapelet transformation domains. For the time domain, we use a set of elastic distance measures. For the other domains, we use a range of standard classifiers. Through extensive experimentation on 72 datasets, including all of the 46 UCR datasets, we demonstrate that the simple collective formed by including all classifiers in one ensemble is significantly more accurate than any of its components and any other previously published TSC algorithm. We investigate alternative hierarchical collective structures and demonstrate the utility of the approach on a new problem involving classifying Caenorhabditis elegans mutant types.
引用
收藏
页码:2522 / 2535
页数:14
相关论文
共 40 条
[1]  
[Anonymous], DATA MINING KNOWL DI
[2]  
[Anonymous], PROGRAMS MACHINE LEA
[3]  
[Anonymous], CMPC1401 U E ANGL DE
[4]  
[Anonymous], 2012, P 2012 SIAM INT C DA
[5]  
Bagnall A., 2015, TIME SERIES CLASSIFI
[6]  
Bagnall A.J., 2004, KDD, P49
[7]   A Run Length Transformation for Discriminating Between Auto Regressive Time Series [J].
Bagnall, Anthony ;
Janacek, Gareth .
JOURNAL OF CLASSIFICATION, 2014, 31 (02) :154-178
[8]   A Bag-of-Features Framework to Classify Time Series [J].
Baydogan, Mustafa Gokce ;
Runger, George ;
Tuv, Eugene .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2796-2802
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   A dictionary of behavioral motifs reveals clusters of genes affecting Caenorhabditis elegans locomotion [J].
Brown, Andre E. X. ;
Yemini, Eviatar I. ;
Grundy, Laura J. ;
Jucikas, Tadas ;
Schafer, William R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (02) :791-796