Learning multivariate shapelets with multi-layer neural networks for interpretable time-series classification

被引:11
作者
Medico, Roberto [1 ]
Ruyssinck, Joeri [1 ]
Deschrijver, Dirk [1 ]
Dhaene, Tom [1 ]
机构
[1] Univ Ghent, IMEC, Dept Informat Technol, IDLab, iGent Tower,Technol Pk Zwijnaarde 126, B-9052 Ghent, Belgium
关键词
Shapelets; Time-series classification; Machine learning; Neural networks;
D O I
10.1007/s11634-021-00437-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Shapelets are discriminative subsequences extracted from time-series data. Classifiers using shapelets have proven to achieve performances competitive to state-of-the-art methods, while enhancing the model's interpretability. While a lot of research has been done for univariate time-series shapelets, extensions for the multivariate setting have not yet received much attention. To extend shapelets-based classification to a multidimensional setting, we developed a novel architecture for shapelets learning, by embedding them as trainable weights in a multi-layer Neural Network. We also investigated the introduction of a novel learning strategy for the shapelets, comprising of two additional terms in the optimization goal, to retrieve a reduced set of uncorrelated shapelets. This paper describes the proposed architecture and presents results on ten publicly available benchmark datasets, as well as a comparison with existing state-of-the-art methods. Moreover, the proposed optimization objective leads the model to automatically select smaller sets of uncorrelated shapelets, thus requiring no additional manual optimization on typically important hyper-parameters such as number and length of shapelets. The results show how the proposed approach achieves competitive performance across the datasets, and always leads to a significant reduction in the number of shapelets used. This can make it faster for a domain expert to match shapelets to real patterns, thus enhancing the interpretability of the model. Finally, since the shapelets learnt during training can be extracted from the model they can serve as meaningful insights on the classifier's decisions and the interactions between different dimensions.
引用
收藏
页码:911 / 936
页数:26
相关论文
共 28 条
[1]  
Bagnall A, 2017, ARXIV PREPRINT ARXIV
[2]  
Bagnall A., 2018, IS ROTATION FOREST B
[3]   The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances [J].
Bagnall, Anthony ;
Lines, Jason ;
Bostrom, Aaron ;
Large, James ;
Keogh, Eamonn .
DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) :606-660
[4]  
Cetin MustafaS., 2015, Proceedings of the 2015 SIAM International Conference on Data Mining, P307
[5]   A time series forest for classification and feature extraction [J].
Deng, Houtao ;
Runger, George ;
Tuv, Eugene ;
Vladimir, Martyanov .
INFORMATION SCIENCES, 2013, 239 :142-153
[6]  
Gao X., 2018, DYNAMIC CHANNEL PRUN
[7]  
Glorot X., 2011, P 14 INT C ARTIFICIA, P315
[8]  
Glorot X., 2010, JMLR WORKSHOP C P, P249
[9]   Learning Time-Series Shapelets [J].
Grabocka, Josif ;
Schilling, Nicolas ;
Wistuba, Martin ;
Schmidt-Thieme, Lars .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :392-401
[10]   Classification of time series by shapelet transformation [J].
Hills, Jon ;
Lines, Jason ;
Baranauskas, Edgaras ;
Mapp, James ;
Bagnall, Anthony .
DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (04) :851-881