Adaptive segmentation-based symbolic representations of time series for better modeling and lower bounding distance measures

被引:0
|
作者
Hugueney, Bernard [1 ]
机构
[1] Univ Paris 09, LAMSADE, F-75775 Paris 16, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series data-mining algorithms usually scale poorly with regard to dimensionality. Symbolic representations have proven to be a very effective way to reduce the dimensionality of time series even using simple aggregations over episodes of the same length and a fixed set of symbols. However, computing adaptive symbolic representations would enable more accurate representations of the dataset without compromising the dimensionality reduction. Therefore we propose a new generic framework to compute adaptive Segmentation Based Symbolic Representations (SBSR) of time series. SBSR can be applied to any model but we focus on piecewise constant models (SBSRLO) which are the most commonly used. SBSR are built by computing both the episode boundaries and the symbolic alphabet in order to minimize information loss of the resulting symbolic representation. We also propose a new distance measure for SBSRLO tightly lower bounding the euclidean distance measure.
引用
收藏
页码:545 / 552
页数:8
相关论文
共 44 条
  • [1] An Equidistant Segmentation-based Similarity Measure for Time Series
    Li, Xiaoru
    Kou, Xiangxia
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 429 - 434
  • [2] Comparison of distance measures in evolutionary time series segmentation
    Yu, Jingwen
    Yin, Jian
    Zhang, Jun
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 456 - +
  • [3] Characterizing synchronization in time series using information measures extracted from symbolic representations
    Monetti, Roberto
    Bunk, Wolfram
    Aschenbrenner, Thomas
    Jamitzky, Ferdinand
    PHYSICAL REVIEW E, 2009, 79 (04):
  • [4] Some Efficient Segmentation-Based Techniques to Improve Time Series Discord Discovery
    Huynh Thi Thu Thuy
    Duong Tuan Anh
    Vo Thi Ngoc Chau
    NATURE OF COMPUTATION AND COMMUNICATION (ICTCC 2016), 2016, 168 : 179 - 188
  • [5] Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures
    Ding, Hui
    Trajcevski, Goce
    Scheuermann, Peter
    Wang, Xiaoyue
    Keogh, Eamonn
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1542 - 1552
  • [6] Energy monitoring of process systems: time-series segmentation-based targeting models
    Abonyi, Janos
    Kulcsar, Tibor
    Balaton, Miklos
    Nagy, Laszlo
    CLEAN TECHNOLOGIES AND ENVIRONMENTAL POLICY, 2014, 16 (07) : 1245 - 1253
  • [7] Energy monitoring of process systems: time-series segmentation-based targeting models
    Janos Abonyi
    Tibor Kulcsar
    Miklos Balaton
    Laszlo Nagy
    Clean Technologies and Environmental Policy, 2014, 16 : 1245 - 1253
  • [8] Pattern distance of time series based on segmentation by important points
    Yu, GZ
    Peng, H
    Zheng, QL
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1563 - 1567
  • [9] SeqDTW: A Segmentation Based Distance Measure for Time Series Data
    Kakuli Mishra
    Srinka Basu
    Ujjwal Maulik
    Transactions of the Indian National Academy of Engineering, 2021, 6 (3) : 709 - 730
  • [10] Piecewise aggregate representations and lower-bound distance functions for multivariate time series
    Li, Hailin
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2015, 427 : 10 - 25