Variable-Length Multivariate Time Series Classification Using ROCKET: A Case Study of Incident Detection

被引:8
|
作者
Bier, Agnieszka [1 ,2 ]
Jastrzebska, Agnieszka [1 ,3 ]
Olszewski, Pawel [1 ]
机构
[1] DTiQ Poland Sp Zoo, PL-44100 Gliwice, Poland
[2] Silesian Tech Univ, Fac Appl Math, PL-44100 Gliwice, Poland
[3] Warsaw Univ Technol, Fac Math & Informat Sci, PL-00662 Warsaw, Poland
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Time series analysis; Rockets; Classification algorithms; Feature extraction; Classification; incident detection; multivariate time series; ROCKET; varying-length time series; FRAUD DETECTION;
D O I
10.1109/ACCESS.2022.3203523
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multivariate time series classification is a machine learning problem that can be applied to automate a wide range of real-world data analysis tasks. RandOm Convolutional KErnel Transform (ROCKET) proved to be an outstanding algorithm capable to classify time series accurately and quickly. The textbook variant of the multivariate time series classification problem assumes that time series to be classified are all of the same length, while in real-world applications this assumption not necessarily holds. The literature of this domain does not pay enough attention to data processing pipelines for variable-length time series. Thus, in this paper, we present a thorough analysis of three preprocessing pipelines that handle variable-length time series that need to be classified with a method that requires the data to be of equal length. These three methods are truncation, padding, and forecasting of missing value. Experiments conducted on benchmark datasets, showed that the recommended procedure involves padding. Forecasting ensures similar classification accuracy, but comes at a much higher computational cost. Truncation is not a viable option. Furthermore, in the paper, we present a novel domain of application of multivariate time series classification algorithms, that is incident detection in cash transactions. This area poses substantive challenges for automated model training procedures since the data is not only variable-length, but also heavily imbalanced. In the study, we list various incident types and present trained classifiers capable to aid human auditors in their daily work.
引用
收藏
页码:95701 / 95715
页数:15
相关论文
共 50 条
  • [1] Exact variable-length anomaly detection algorithm for univariate and multivariate time series
    Xing Wang
    Jessica Lin
    Nital Patel
    Martin Braun
    Data Mining and Knowledge Discovery, 2018, 32 : 1806 - 1844
  • [2] Exact variable-length anomaly detection algorithm for univariate and multivariate time series
    Wang, Xing
    Lin, Jessica
    Patel, Nital
    Braun, Martin
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (06) : 1806 - 1844
  • [3] Convolutional Neural Networks for Time-dependent Classification of Variable-length Time Series
    Sawada, Azusa
    Miyagawa, Taiki
    Ebihara, Akinori
    Yachida, Shoji
    Hosoi, Toshinori
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] Variable-Length Subsequence Clustering in Time Series
    Duan, Jiangyong
    Guo, Lili
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (02) : 983 - 995
  • [5] Beyond Information Distortion: Imaging Variable-Length Time Series Data for Classification
    Lee, Hyeonsu
    Shin, Dongmin
    SENSORS, 2025, 25 (03)
  • [6] Modelling of chaotic time series using a variable-length windowing approach
    Tekbas, ÖH
    CHAOS SOLITONS & FRACTALS, 2006, 29 (02) : 277 - 281
  • [7] A Variable-Length Motifs Discovery Method in Time Series using Hybrid Approach
    Zan, Chaw Thet
    Yamana, Hayato
    19TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2017), 2017, : 49 - 57
  • [8] Fault Detection on Variable Length Multivariate Time Series from Semiconductor Manufacturing
    Tchatchoua, Philip
    Graton, Guillaume
    Ouladsine, Mustapha
    Christaud, Jean-Francois
    2023 IEEE SENSORS, 2023,
  • [9] Exploring variable-length time series motifs in one hundred million length scale
    Yifeng Gao
    Jessica Lin
    Data Mining and Knowledge Discovery, 2018, 32 : 1200 - 1228
  • [10] Exploring variable-length time series motifs in one hundred million length scale
    Gao, Yifeng
    Lin, Jessica
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (05) : 1200 - 1228