Multivariate times series classification through an interpretable representation

被引:17
作者
Baldan, Francisco J. [1 ]
Benitez, Jose M. [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, DaSCI, DICITS,iMUDS, Granada 18071, Spain
关键词
Multivariate; Time series features; Complexity measures; Time series interpretation; Classification;
D O I
10.1016/j.ins.2021.05.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multivariate time series classification is a machine learning task with increasing importance due to the proliferation of information sources in different domains (economy, health, energy, crops, etc.). Univariate methods lack the ability to capture the relationships between the different variables that compose a multivariate time series and therefore cannot be directly extrapolated to multivariate environments. Despite the good performance and competitive results of the multivariate proposals published to date, they are hard to interpret due to their high complexity. In this paper, we propose a multivariate time series classification method based on an alternative representation of the time series, composed of a set of 41 descriptive time series features, in order to improve the interpretability of time series and results obtained. Our proposal uses traditional classifiers over the extracted features to look for relationships between the different variables that form a multivariate time series. We have selected four state-of-the-art algorithms as base classifiers to evaluate our method. We have tested our proposal on the complete University of East Anglia repository, obtaining highly interpretable results capable of explaining the relationships between the features that compose the time series and achieving performance results statistically indistinguishable from the best algorithms of the state-of-the-art. (c) 2021 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:596 / 614
页数:19
相关论文
共 43 条
[1]  
Abdiansah A., 2015, Int. J. Comput. Appl, V128, P28, DOI [DOI 10.5120/IJCA2015906480, 10.5120/ijca2015906480]
[2]   Robust classification of multivariate time series by imprecise hidden Markov models [J].
Antonucci, Alessandro ;
De Rosa, Rocco ;
Giusti, Alessandro ;
Cuzzolin, Fabio .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2015, 56 :249-263
[3]  
Bagnall A., 2018, IS ROTATION FOREST B
[4]  
Baldan F.J., ARXIV PREPRINT ARXIV
[5]   Distributed FastShapelet Transform: a Big Data time series classification algorithm [J].
Baldan, Francisco J. ;
Benitez, Jose M. .
INFORMATION SCIENCES, 2019, 496 :451-463
[6]  
Baydogan M, 2017, MULTIVARIATE TIME SE
[7]   Time series representation and similarity based on local autopatterns [J].
Baydogan, Mustafa Gokce ;
Runger, George .
DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (02) :476-509
[8]   Learning a symbolic representation for multivariate time series classification [J].
Baydogan, Mustafa Gokce ;
Runger, George .
DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (02) :400-422
[9]  
Bostrom A., ARXIV PREPRINT ARXIV
[10]   The origins of the Gini index: extracts from VariabilitA e MutabilitA (1912) by Corrado Gini [J].
Ceriani, Lidia ;
Verme, Paolo .
JOURNAL OF ECONOMIC INEQUALITY, 2012, 10 (03) :421-443