The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances

被引:290
作者
Ruiz, Alejandro Pasos [1 ]
Flynn, Michael [1 ]
Large, James [1 ]
Middlehurst, Matthew [1 ]
Bagnall, Anthony [1 ]
机构
[1] Univ East Anglia, Sch Comp Sci, Norwich, Norfolk, England
基金
英国工程与自然科学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
Time series classification; Evaluating classifiers; Multivariate time series; UEA archive; REPRESENTATION;
D O I
10.1007/s10618-020-00727-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time Series Classification (TSC) involves building predictive models for a discrete target variable from ordered, real valued, attributes. Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art. The main focus has been on univariate TSC, i.e. the problem where each case has a single series and a class label. In reality, it is more common to encounter multivariate TSC (MTSC) problems where the time series for a single case has multiple dimensions. Despite this, much less consideration has been given to MTSC than the univariate case. The UCR archive has provided a valuable resource for univariate TSC, and the lack of a standard set of test problems may explain why there has been less focus on MTSC. The UEA archive of 30 MTSC problems released in 2018 has made comparison of algorithms easier. We review recently proposed bespoke MTSC algorithms based on deep learning, shapelets and bag of words approaches. If an algorithm cannot naturally handle multivariate data, the simplest approach to adapt a univariate classifier to MTSC is to ensemble it over the multivariate dimensions. We compare the bespoke algorithms to these dimension independent approaches on the 26 of the 30 MTSC archive problems where the data are all of equal length. We demonstrate that four classifiers are significantly more accurate than the benchmark dynamic time warping algorithm and that one of these recently proposed classifiers, ROCKET, achieves significant improvement on the archive datasets in at least an order of magnitude less time than the other three.
引用
收藏
页码:401 / 449
页数:49
相关论文
共 53 条
[1]  
Alimoglu F., 2001, Turkish Journal Electrical Engineering and Computer Sciences, Elektrik, V9, P1
[2]  
Bagnall A., 2018, UEA MULTIVARIATE TIM
[3]  
Bagnall A., 2020, LECT NOTES ARTIFICIA, V12588
[4]  
Bagnall A, 2019, ARXIV190905738
[5]   The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances [J].
Bagnall, Anthony ;
Lines, Jason ;
Bostrom, Aaron ;
Large, James ;
Keogh, Eamonn .
DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) :606-660
[6]   Time series representation and similarity based on local autopatterns [J].
Baydogan, Mustafa Gokce ;
Runger, George .
DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (02) :476-509
[7]  
Benavoli A, 2016, J MACH LEARN RES, V17
[8]   A spelling device for the paralysed [J].
Birbaumer, N ;
Ghanayim, N ;
Hinterberger, T ;
Iversen, I ;
Kotchoubey, B ;
Kübler, A ;
Perelmouter, J ;
Taub, E ;
Flor, H .
NATURE, 1999, 398 (6725) :297-298
[9]  
Blankertz B, 2002, ADV NEUR IN, V14, P157
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32