Comparison of Measures for Characterizing the Difficulty of Time Series Classification

被引:0
作者
Charane, Adam [1 ]
Ceccarello, Matteo [2 ]
Gamper, Johann [1 ]
机构
[1] Free Univ Bozen Bolzano, Bolzano, Italy
[2] Univ Padua, Padua, Italy
来源
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2024 | 2024年 / 14912卷
关键词
Complexity measures; Time series classification;
D O I
10.1007/978-3-031-68323-7_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of machine learning algorithms is influenced both by their characteristics and parameterization as well as by the properties of the data they are trained and evaluated on. The latter aspect is often neglected. In this paper, we focus our attention on properties of the data that affect the accuracy of time series classification. We experimentally study how the difficulty of classifying time series is related to well-known model-agnostic data complexity measures. Our experiments show that (a) many of these measures are highly correlated with classification scores such as accuracy and F1 and (b) different families of complexity measures capture different properties of the data.
引用
收藏
页码:239 / 244
页数:6
相关论文
共 9 条
[1]   The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances [J].
Bagnall, Anthony ;
Lines, Jason ;
Bostrom, Aaron ;
Large, James ;
Keogh, Eamonn .
DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) :606-660
[2]   Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python']Python package) [J].
Christ, Maximilian ;
Braun, Nils ;
Neuffer, Julius ;
Kempa-Liehr, Andreas W. .
NEUROCOMPUTING, 2018, 307 :72-77
[3]  
Dau H.A., 2018, Hexagon-ML: the UCR time series classification archive
[4]   GENERAL COEFFICIENT OF SIMILARITY AND SOME OF ITS PROPERTIES [J].
GOWER, JC .
BIOMETRICS, 1971, 27 (04) :857-&
[5]  
Ho TK, 2002, IEEE T PATTERN ANAL, V24, P289, DOI 10.1109/34.990132
[6]   How Complex Is Your Classification Problem?: A Survey on Measuring Classification Complexity [J].
Lorena, Ana C. ;
Garcia, Luis P. F. ;
Lehmann, Jens ;
Souto, Marcilio C. P. ;
Ho, Tin Kam .
ACM COMPUTING SURVEYS, 2019, 52 (05)
[7]   catch22: CAnonical Time-series CHaracteristics Selected through highly comparative time-series analysis [J].
Lubba, Carl H. ;
Sethi, Sarab S. ;
Knaute, Philip ;
Schultz, Simon R. ;
Fulcher, Ben D. ;
Jones, Nick S. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (06) :1821-1852
[8]   PRISM - A novel framework for pattern recognition [J].
Singh, S .
PATTERN ANALYSIS AND APPLICATIONS, 2003, 6 (02) :134-149
[9]  
Sohn SY, 1999, IEEE T PATTERN ANAL, V21, P1137, DOI 10.1109/34.809107