Distribution Agnostic Symbolic Representations for Time Series Dimensionality Reduction and Online Anomaly Detection

被引:5
|
作者
Bountrogiannis, Konstantinos [1 ,2 ]
Tzagkarakis, George [2 ]
Tsakalides, Panagiotis [2 ]
机构
[1] Univ Crete, Comp Sci Dept, Iraklion 70013, Greece
[2] Fdn Res & Technol Hellas, Inst Comp Sci, GR-70013 Iraklion, Greece
关键词
Time series analysis; Data mining; Anomaly detection; Aggregates; Task analysis; Quantization (signal); Market research; dynamic clustering; kernel methods; streaming data; symbolic representations; time series analysis; AGGREGATE APPROXIMATION; MEAN SHIFT;
D O I
10.1109/TKDE.2022.3174630
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the importance of the lower bounding distances and the attractiveness of symbolic representations, the family of symbolic aggregate approximations (SAX) has been used extensively for encoding time series data. However, typical SAX-based methods rely on two restrictive assumptions; the Gaussian distribution and equiprobable symbols. This paper proposes two novel data-driven SAX-based symbolic representations, distinguished by their discretization steps. The first representation, oriented for general data compaction and indexing scenarios, is based on the combination of kernel density estimation and Lloyd-Max quantization to minimize the information loss and mean squared error in the discretization step. The second method, oriented for high-level mining tasks, employs the Mean-Shift clustering method and is shown to enhance anomaly detection in the lower-dimensional space. Besides, we verify on a theoretical basis a previously observed phenomenon of the intrinsic process that results in a lower than the expected variance of the intermediate piecewise aggregate approximation. This phenomenon causes an additional information loss but can be avoided with a simple modification. The proposed representations possess all the attractive properties of the conventional SAX method. Furthermore, experimental evaluation on real-world datasets demonstrates their superiority compared to the traditional SAX and an alternative data-driven SAX variant.
引用
收藏
页码:5752 / 5766
页数:15
相关论文
共 50 条
  • [21] A Review on Time Series Dimensionality Reduction
    Badhiye, Sagar S.
    Chatur, P. N.
    HELIX, 2018, 8 (05): : 3957 - 3960
  • [22] Analysing Time Series using Symbolic Representations
    Monetti, R.
    Bunk, W.
    Jamitzky, F.
    TOPICS ON CHAOTIC SYSTEMS, 2009, : 242 - 250
  • [23] Anomaly Detection for Telemetry Time Series Using a Denoising Diffusion Probabilistic Model
    Sui, Jialin
    Yu, Jinsong
    Song, Yue
    Zhang, Jian
    IEEE SENSORS JOURNAL, 2024, 24 (10) : 16429 - 16439
  • [24] A Novel Time Series Representation Approach for Dimensionality Reduction
    Bawaneh, Mohammad
    Simon, Vilmos
    INFOCOMMUNICATIONS JOURNAL, 2022, 14 (02): : 44 - 55
  • [25] A Flexible Framework for Anomaly Detection via Dimensionality Reduction
    Sadr, Alireza Vafaei
    Bassett, Bruce A.
    Kunz, M.
    2019 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2019), 2019, : 106 - 110
  • [26] A flexible framework for anomaly Detection via dimensionality reduction
    Sadr, Alireza Vafaei
    Bassett, Bruce A.
    Kunz, M.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02) : 1157 - 1167
  • [27] A flexible framework for anomaly Detection via dimensionality reduction
    Alireza Vafaei Sadr
    Bruce A. Bassett
    M. Kunz
    Neural Computing and Applications, 2023, 35 : 1157 - 1167
  • [28] Learning Sparse Latent Graph Representations for Anomaly Detection in Multivariate Time Series
    Han, Siho
    Woo, Simon S.
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2977 - 2986
  • [29] Topological data analysis for unsupervised anomaly detection in time series
    Bois, Alexandre
    Tervil, Brian
    Oudre, Laurent
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 1197 - 1201
  • [30] TSAGen: Synthetic Time Series Generation for KPI Anomaly Detection
    Wang, Chengyu
    Wu, Kui
    Zhou, Tongqing
    Yu, Guang
    Cai, Zhiping
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (01): : 130 - 145