Speeding Up Similarity Search on a Large Time Series Dataset under Time Warping Distance

被引:0
作者
Ruengronghirunya, Pongsakorn [1 ]
Niennattrakul, Vit [1 ]
Ratanamahatana, Chotirat Ann [1 ]
机构
[1] Chulalongkorn Univ, Dept Comp Engn, Bangkok 10330, Thailand
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS | 2009年 / 5476卷
关键词
Similarity Search; Dynamic Time Warping; Time Series Data; Indexing; Lower Bounding Function;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series data are a ubiquitous data type appearing in many domains such as statistics, finance, multimedia, etc. Similarity search and measurement on time series data are typically different from on other data types since time series data have the associations among adjacent dimensions. Accordingly, the classic Euclidean distance metric is not an accurate similarity measure for lime series. Therefore, Dynamic Time Warping (DTW) has become a better choice for similarity measurement on time series in various applications regardless of its high computational cost. To speed tip the calculation, many research works attempt to speed up DTW calculation using indexing method, which always has a tradeoff between indexing efficiency and I/O cost. In this paper, we propose a novel method to balance this tradeoff under indexed sequential access using Sequentially Indexed Structure (SIS), an approach to time series indexing with low computational cost and small overheads on I/O. Finally, we conduct experiments to demonstrate our superiority in speed performance over the best existing method.
引用
收藏
页码:981 / 988
页数:8
相关论文
共 11 条
[1]  
Agrawal R., 1993, Proceedings of the International Conference on Foundations of Data Organization and Algorithms, Chicago, IL, P69
[2]  
[Anonymous], P INT C MAN DAT
[3]  
BECKMANN N, 1990, SIGMOD REC, V19, P322, DOI 10.1145/93605.98741
[4]  
BERCHTOLD S, 1998, P 1998 ACM SIGMOD IN
[5]  
EUACHONGPRASIT W, 2008, P PAC AS C KNOWL DIS
[6]   Exact indexing of dynamic time warping [J].
Keogh, E ;
Ratanamahatana, CA .
KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (03) :358-386
[7]  
Keogh E., UCR TIME SERIES CLAS
[8]  
MacQueen J., 1967, 5 BERK S MATH STAT P, P281
[9]  
Ratanamahatana CA, 2005, SIAM PROC S, P506
[10]  
Sakurai Y., 2005, PRINCIPLES DATABASE, P326