Time Warping Symbolic Aggregation Approximation with Bag-of-Patterns Representation for Time Series Classification

被引:4
作者
Wang, Zhiguang [1 ]
Oates, Tim [1 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21228 USA
来源
2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA) | 2014年
关键词
D O I
10.1109/ICMLA.2014.49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard Symbolic Aggregation approXimation (SAX) is at the core of many effective time series data mining algorithms. Its combination with Bag-of-Patterns (BoP) has become the standard approach with state-of-the-art performance on standard datasets. However, standard SAX with the BoP representation might neglect internal temporal correlation embedded in the raw data. In this paper, we proposed time warping SAX, which extends the standard SAX with time delay embedding vector approaches to account for temporal correlations. We test time warping SAX with the BoP representation on 12 benchmark datasets from the UCR Time Series Classification/Clustering Collection. On 9 datasets, time warping SAX overtakes the state-of-the-art performance of the standard SAX. To validate our methods in real world applications, a new dataset of vital signs data collected from patients who may require blood transfusion in the next 6 hours was tested. All the results demonstrate that, by considering the temporal internal correlation, time warping SAX combined with BoP improves classification performance.
引用
收藏
页码:270 / 275
页数:6
相关论文
共 16 条
[1]  
[Anonymous], 2003, DMKD, DOI DOI 10.1145/882082.882086
[2]  
[Anonymous], DAT MIN 5 IEEE INT C
[3]   Permutation entropy: A natural complexity measure for time series [J].
Bandt, C ;
Pompe, B .
PHYSICAL REVIEW LETTERS, 2002, 88 (17) :4
[4]  
Box G.E.P., 2013, TIME SERIES ANAL FOR, V4nd, DOI DOI 10.1002/9781118619193
[5]  
Camerra A., 2010, Proceedings 2010 10th IEEE International Conference on Data Mining (ICDM 2010), P58, DOI 10.1109/ICDM.2010.124
[6]  
Clark S. P., 1990, LINCOLN LAB J, V3
[7]   A review of symbolic analysis of experimental data [J].
Daw, CS ;
Finney, CEA ;
Tracy, ER .
REVIEW OF SCIENTIFIC INSTRUMENTS, 2003, 74 (02) :915-930
[8]   Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases [J].
Eamonn Keogh ;
Kaushik Chakrabarti ;
Michael Pazzani ;
Sharad Mehrotra .
Knowledge and Information Systems, 2001, 3 (3) :263-286
[9]  
Kumar N, 2005, SIAM PROC S, P531
[10]  
Lin CD, 2008, 2008 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS, VOLS 1 AND 2, P1