Distribution Agnostic Symbolic Representations for Time Series Dimensionality Reduction and Online Anomaly Detection

被引:5
作者
Bountrogiannis, Konstantinos [1 ,2 ]
Tzagkarakis, George [2 ]
Tsakalides, Panagiotis [2 ]
机构
[1] Univ Crete, Comp Sci Dept, Iraklion 70013, Greece
[2] Fdn Res & Technol Hellas, Inst Comp Sci, GR-70013 Iraklion, Greece
关键词
Time series analysis; Data mining; Anomaly detection; Aggregates; Task analysis; Quantization (signal); Market research; dynamic clustering; kernel methods; streaming data; symbolic representations; time series analysis; AGGREGATE APPROXIMATION; MEAN SHIFT;
D O I
10.1109/TKDE.2022.3174630
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the importance of the lower bounding distances and the attractiveness of symbolic representations, the family of symbolic aggregate approximations (SAX) has been used extensively for encoding time series data. However, typical SAX-based methods rely on two restrictive assumptions; the Gaussian distribution and equiprobable symbols. This paper proposes two novel data-driven SAX-based symbolic representations, distinguished by their discretization steps. The first representation, oriented for general data compaction and indexing scenarios, is based on the combination of kernel density estimation and Lloyd-Max quantization to minimize the information loss and mean squared error in the discretization step. The second method, oriented for high-level mining tasks, employs the Mean-Shift clustering method and is shown to enhance anomaly detection in the lower-dimensional space. Besides, we verify on a theoretical basis a previously observed phenomenon of the intrinsic process that results in a lower than the expected variance of the intermediate piecewise aggregate approximation. This phenomenon causes an additional information loss but can be avoided with a simple modification. The proposed representations possess all the attractive properties of the conventional SAX method. Furthermore, experimental evaluation on real-world datasets demonstrates their superiority compared to the traditional SAX and an alternative data-driven SAX variant.
引用
收藏
页码:5752 / 5766
页数:15
相关论文
共 50 条
  • [31] QDetect: Time Series Querying Based Road Anomaly Detection
    Zheng, Zengwei
    Zhou, Mingxuan
    Chen, Yuanyi
    Huo, Meimei
    Sun, Lin
    IEEE ACCESS, 2020, 8 : 98974 - 98985
  • [32] Anomaly Detection in Event-Triggered Traffic Time Series via Similarity Learning
    Dou, Shaoyu
    Yang, Kai
    Jiao, Yang
    Qiu, Chengbo
    Ren, Kui
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2025, 22 (02) : 888 - 902
  • [33] Outsourced privacy-preserving anomaly detection in time series of multi-party
    Zhang, Chunkai
    Zuo, Wei
    Yang, Peng
    Li, Ye
    Wang, Xuan
    CHINA COMMUNICATIONS, 2022, 19 (02) : 201 - 213
  • [34] Local Anomaly Detection for Multivariate Time Series by Temporal Dependency Based on Poisson Model
    Benkabou, Seif-Eddine
    Benabdeslem, Khalid
    Kraus, Vivien
    Bourhis, Kilian
    Canitia, Bruno
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6701 - 6711
  • [35] Symbolic Time Series Analysis for Anomaly Detection in Measure-Invariant Ergodic Systems
    Ghalyan, Najah F.
    Ray, Asok
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2020, 142 (06):
  • [36] A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection
    Jin, Ming
    Koh, Huan Yee
    Wen, Qingsong
    Zambon, Daniele
    Alippi, Cesare
    Webb, Geoffrey I.
    King, Irwin
    Pan, Shirui
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10466 - 10485
  • [37] A SURVEY OF RESEARCH ON ANOMALY DETECTION FOR TIME SERIES
    Wu, Hu-Sheng
    2016 13TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2016, : 426 - 431
  • [38] A Survey on Dimensionality Reduction Techniques for Time-Series Data
    Ashraf, Mohsena
    Anowar, Farzana
    Setu, Jahanggir H.
    Chowdhury, Atiqul I.
    Ahmed, Eshtiak
    Islam, Ashraful
    Al-Mamun, Abdullah
    IEEE ACCESS, 2023, 11 : 42909 - 42923
  • [39] Improving Network Security through Traffic Log Anomaly Detection Using Time Series Analysis
    Rodriguez, Aitor Corchero
    de los Mozos, Mario Reyes
    COMPUTATIONAL INTELLIGENCE IN SECURITY FOR INFORMATION SYSTEMS 2010, 2010, 85 : 125 - 133
  • [40] Multichannel Anomaly Detection for Spacecraft Time Series Using MAP Estimation
    Li, Tianyu
    Baireddy, Sriram
    Comer, Mary
    Delp, Edward
    Desai, Sundip R.
    Foster, Richard H.
    Chan, Moses W.
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (05) : 5842 - 5855