The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances

被引:290
作者
Ruiz, Alejandro Pasos [1 ]
Flynn, Michael [1 ]
Large, James [1 ]
Middlehurst, Matthew [1 ]
Bagnall, Anthony [1 ]
机构
[1] Univ East Anglia, Sch Comp Sci, Norwich, Norfolk, England
基金
英国工程与自然科学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
Time series classification; Evaluating classifiers; Multivariate time series; UEA archive; REPRESENTATION;
D O I
10.1007/s10618-020-00727-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time Series Classification (TSC) involves building predictive models for a discrete target variable from ordered, real valued, attributes. Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art. The main focus has been on univariate TSC, i.e. the problem where each case has a single series and a class label. In reality, it is more common to encounter multivariate TSC (MTSC) problems where the time series for a single case has multiple dimensions. Despite this, much less consideration has been given to MTSC than the univariate case. The UCR archive has provided a valuable resource for univariate TSC, and the lack of a standard set of test problems may explain why there has been less focus on MTSC. The UEA archive of 30 MTSC problems released in 2018 has made comparison of algorithms easier. We review recently proposed bespoke MTSC algorithms based on deep learning, shapelets and bag of words approaches. If an algorithm cannot naturally handle multivariate data, the simplest approach to adapt a univariate classifier to MTSC is to ensemble it over the multivariate dimensions. We compare the bespoke algorithms to these dimension independent approaches on the 26 of the 30 MTSC archive problems where the data are all of equal length. We demonstrate that four classifiers are significantly more accurate than the benchmark dynamic time warping algorithm and that one of these recently proposed classifiers, ROCKET, achieves significant improvement on the archive datasets in at least an order of magnitude less time than the other three.
引用
收藏
页码:401 / 449
页数:49
相关论文
共 53 条
[31]   Experiencing SAX: a novel symbolic representation of time series [J].
Lin, Jessica ;
Keogh, Eamonn ;
Wei, Li ;
Lonardi, Stefano .
DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 15 (02) :107-144
[32]   Time Series Classification with HIVE-COTE: The Hierarchical Vote Collective of Transformation-Based Ensembles [J].
Lines, Jason ;
Taylor, Sarah ;
Bagnall, Anthony .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (05)
[33]   uWave: Accelerometer-based personalized gesture recognition and its applications [J].
Liu, Jiayang ;
Zhong, Lin ;
Wickramasuriya, Jehan ;
Vasudevan, Venu .
PERVASIVE AND MOBILE COMPUTING, 2009, 5 (06) :657-675
[34]  
Loning M., 2019, Sktime: A unified interface for machine learning with time series
[35]   catch22: CAnonical Time-series CHaracteristics Selected through highly comparative time-series analysis [J].
Lubba, Carl H. ;
Sethi, Sarab S. ;
Knaute, Philip ;
Schultz, Simon R. ;
Fulcher, Ben D. ;
Jones, Nick S. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (06) :1821-1852
[36]  
Middlehurst M, 2020, IEEE INT CONF BIG DA, P188, DOI [10.1109/BigData50022.2020.9378424, 10.1109/BigData20022.9378424]
[37]   Scalable Dictionary Classifiers for Time Series Classification [J].
Middlehurst, Matthew ;
Vickers, William ;
Bagnall, Anthony .
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 :11-19
[38]   Face Stability Analysis for a Shield-Driven Tunnel in Anisotropic and Nonhomogeneous Soils by the Kinematical Approach [J].
Pan, Qiujing ;
Dias, Daniel .
INTERNATIONAL JOURNAL OF GEOMECHANICS, 2016, 16 (03)
[39]  
Pasos-Ruiz A, 2020, ARXIV200713156
[40]  
Ratanamahatana CA, 2005, SIAM PROC S, P506