Scalable time series classification

被引:89
|
作者
Schaefer, Patrick [1 ]
机构
[1] Zuse Inst Berlin, Takustr 7, D-14195 Berlin, Germany
关键词
Time series; Classification; Data mining; Symbolic representation; REPRESENTATION; MODEL;
D O I
10.1007/s10618-015-0441-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series classification tries to mimic the human understanding of similarity. When it comes to long or larger time series datasets, state-of-the-art classifiers reach their limits because of unreasonably high training or testing times. One representative example is the 1-nearest-neighbor dynamic time warping classifier (1-NN DTW) that is commonly used as the benchmark to compare to. It has several shortcomings: it has a quadratic time complexity in the time series length and its accuracy degenerates in the presence of noise. To reduce the computational complexity, early abandoning techniques, cascading lower bounds, or recently, a nearest centroid classifier have been introduced. Still, classification times on datasets of a few thousand time series are in the order of hours. We present our Bag-Of-SFA-Symbols in Vector Space classifier that is accurate, fast and robust to noise. We show that it is significantly more accurate than 1-NN DTW while being multiple orders of magnitude faster. Its low computational complexity combined with its good classification accuracy makes it relevant for use cases like long or large amounts of time series or real-time analytics.
引用
收藏
页码:1273 / 1298
页数:26
相关论文
共 50 条
  • [1] Scalable time series classification
    Patrick Schäfer
    Data Mining and Knowledge Discovery, 2016, 30 : 1273 - 1298
  • [2] Scalable Dictionary Classifiers for Time Series Classification
    Middlehurst, Matthew
    Vickers, William
    Bagnall, Anthony
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 : 11 - 19
  • [3] Scalable Classification of Univariate and Multivariate Time Series
    Karimi-Bidhendi, Saeed
    Munshi, Faramarz
    Munshi, Ashfaq
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1598 - 1605
  • [4] A Scalable Segmented Dynamic Time Warping for Time Series Classification
    Ma, Ruizhe
    Ahmadzadeh, Azim
    Boubrahimi, Soukaina Filali
    Angryk, Rafal A.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2019, PT II, 2019, 11509 : 407 - 419
  • [5] Regularized shapelet learning for scalable time series classification
    Zhao, Huiyun
    Pan, Zhisong
    Tao, Wei
    COMPUTER NETWORKS, 2020, 173 (173)
  • [6] Scalable and accurate subsequence transform for time series classification
    Mbouopda, Michael Franklin
    Nguifo, Engelbert Mephu
    PATTERN RECOGNITION, 2024, 147
  • [7] Scalable Time Series Classification in streaming and batch environments on Apache Spark
    Glenis, Apostolos
    2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA 2020), 2020, : 104 - 111
  • [8] Scalable Classification of Repetitive Time Series Through Frequencies of Local Polynomials
    Grabocka, Josif
    Wistuba, Martin
    Schmidt-Thieme, Lars
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1683 - 1695
  • [9] TS-CHIEF: a scalable and accurate forest algorithm for time series classification
    Ahmed Shifaz
    Charlotte Pelletier
    François Petitjean
    Geoffrey I. Webb
    Data Mining and Knowledge Discovery, 2020, 34 : 742 - 775
  • [10] Scalable classifier-agnostic channel selection for multivariate time series classification
    Bhaskar Dhariyal
    Thach Le Nguyen
    Georgiana Ifrim
    Data Mining and Knowledge Discovery, 2023, 37 : 1010 - 1054