Symbolic Time Series Representation for Stream Data Processing

被引:6
作者
Sevcech, Jakub [1 ]
Bielikova, Maria [1 ]
机构
[1] Slovak Univ Technol Bratislava, Fac Informat & Informat Technol, Ilkovicova 2, Bratislava 84216, Slovakia
来源
2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 2 | 2015年
关键词
time series representation; symbolic representation; stream processing; lower bound;
D O I
10.1109/Trustcom.2015.586
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past years, many representations for time series were proposed with the main purpose of dimensionality reduction and as a support for various algorithms in the domain of time series data processing. However, most of the transformation algorithms are not directly applicable on streams of data but only on static collections of the data as they are iterative in their nature. In this work we propose a symbolic representation of time series along with the method for transformation of the data into proposed representation. As one of the basic requirements for applicable representation is the distance measure which would accurately reflect the true shape of the data, we propose a distance measure operating on the proposed representation and lower bounding the Euclidean distance on the original data. We evaluate properties of the proposed representation and the distance measure on a number of different datasets.
引用
收藏
页码:217 / 222
页数:6
相关论文
共 10 条
  • [1] [Anonymous], P 6 AUSTR C DAT MIN
  • [2] [Anonymous], KDD P
  • [3] Baydogan M.G., 2014, Data Mining and Knowledge Discovery, P1
  • [4] Fu TC, 2005, LECT NOTES ARTIF INT, V3613, P1171
  • [5] Giannella C., 2003, NEXT GENERATION DATA
  • [6] Clustering of time-series subsequences is meaningless: implications for previous and future research
    Keogh, E
    Lin, J
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (02) : 154 - 177
  • [7] Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases
    Eamonn Keogh
    Kaushik Chakrabarti
    Michael Pazzani
    Sharad Mehrotra
    [J]. Knowledge and Information Systems, 2001, 3 (3) : 263 - 286
  • [8] Keogh E., UCR TIME SERIES CLAS
  • [9] Experiencing SAX: a novel symbolic representation of time series
    Lin, Jessica
    Keogh, Eamonn
    Wei, Li
    Lonardi, Stefano
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 15 (02) : 107 - 144
  • [10] Experimental comparison of representation methods and distance measures for time series data
    Wang, Xiaoyue
    Mueen, Abdullah
    Ding, Hui
    Trajcevski, Goce
    Scheuermann, Peter
    Keogh, Eamonn
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 26 (02) : 275 - 309