Back to Basics: A Sanity Check on Modern Time Series Classification Algorithms

被引:4
作者
Dhariyal, Bhaskar [1 ]
Le Nguyen, Thach [1 ]
Ifrim, Georgiana [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci, Dublin, Ireland
来源
ADVANCED ANALYTICS AND LEARNING ON TEMPORAL DATA, AALTD 2023 | 2023年 / 14343卷
关键词
Time series; Classification; Evaluation; Baselines;
D O I
10.1007/978-3-031-49896-1_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The state-of-the-art in time series classification has come a long way, from the 1NN-DTW algorithm to the ROCKET family of classifiers. However, in the current fast-paced development of new classifiers, taking a step back and performing simple baseline checks is essential. These checks are often overlooked, as researchers are focused on establishing new state-of-the-art results, developing scalable algorithms, and making models explainable. Nevertheless, there are many datasets that look like time series at first glance, but classic algorithms such as tabular methods with no time ordering may perform better on such problems. For example, for spectroscopy datasets, tabular methods tend to significantly outperform recent time series methods. In this study, we compare the performance of tabular models using classic machine learning approaches (e.g., Ridge, LDA, RandomForest) with the ROCKET family of classifiers (e.g., Rocket, MiniRocket, MultiRocket). Tabular models are simple and very efficient, while the ROCKET family of classifiers are more complex and have state-of-the-art accuracy and efficiency among recent time series classifiers. We find that tabular models outperform the ROCKET family of classifiers on approximately 19% of univariate and 28% of multivariate datasets in the UCR/UEA benchmark and achieve accuracy within 10% points on about 50% of datasets. Our results suggest that it is important to consider simple tabular models as baselines when developing time series classifiers. These models are very fast, can be as effective as more complex methods and may be easier to understand and deploy.
引用
收藏
页码:205 / 229
页数:25
相关论文
共 22 条
[1]  
Bagnall A., 2012, P SIAM INT C DATA MI, P307
[2]  
Bagnall A, 2016, Arxiv, DOI arXiv:1602.01711
[3]   Hydra: competing convolutional kernels for fast and accurate time series classification [J].
Dempster, Angus ;
Schmidt, Daniel F. ;
Webb, Geoffrey I. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 37 (05) :1779-1805
[4]   MINIROCKET A Very Fast (Almost) Deterministic Transform for Time Series Classification [J].
Dempster, Angus ;
Schmidt, Daniel F. ;
Webb, Geoffrey, I .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :248-257
[5]   ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels [J].
Dempster, Angus ;
Petitjean, Francois ;
Webb, Geoffrey, I .
DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (05) :1454-1495
[6]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[7]   Scalable classifier-agnostic channel selection for multivariate time series classification [J].
Dhariyal, Bhaskar ;
Le Nguyen, Thach ;
Ifrim, Georgiana .
DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 37 (02) :1010-1054
[8]   InceptionTime: Finding AlexNet for time series classification [J].
Fawaz, Hassan Ismail ;
Lucas, Benjamin ;
Forestier, Germain ;
Pelletier, Charlotte ;
Schmidt, Daniel F. ;
Weber, Jonathan ;
Webb, Geoffrey, I ;
Idoumghar, Lhassane ;
Muller, Pierre-Alain ;
Petitjean, Francois .
DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (06) :1936-1962
[9]   Classification of cow diet based on milk Mid Infrared Spectra: A data analysis competition at the ?International Workshop on Spectroscopy and Chemometrics [J].
Frizzarin, Maria ;
Visentin, Giulio ;
Ferragina, Alessandro ;
Hayes, Elena ;
Bevilacqua, Antonio ;
Dhariyal, Bhaskar ;
Domijan, Katarina ;
Khan, Hussain ;
Ifrim, Georgiana ;
Le Nguyen, Thach ;
Meagher, Joe ;
Menchetti, Laura ;
Singh, Ashish ;
Whoriskey, Suzy ;
Williamson, Robert ;
Zappaterra, Martina ;
Casa, Alessandro .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2023, 234
[10]   Mid infrared spectroscopy and milk quality traits: A data analysis competition at the "International Workshop on Spectroscopy and Chemometrics 2021" [J].
Frizzarin, Maria ;
Bevilacqua, Antonio ;
Dhariyal, Bhaskar ;
Domijan, Katarina ;
Ferraccioli, Federico ;
Hayes, Elena ;
Ifrim, Georgiana ;
Konkolewska, Agnieszka ;
Le Nguyen, Thach ;
Mbaka, Uche ;
Ranzato, Giovanna ;
Singh, Ashish ;
Stefanucci, Marco ;
Casa, Alessandro .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 219