Fast normalization-transformed subsequence matching in time-series databases

被引:6
作者
Moon, Yang-Sae [1 ]
Kim, Jinho [1 ]
机构
[1] Kangwon Natl Univ, Dept Comp Sci, Kangwon Do, South Korea
关键词
data mining; time-series databases; subsequence matching; normalization transform;
D O I
10.1093/ietisy/e90-d.12.2007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Normalization transform is known to be very useful for finding the overall trend of time-series data since it enables finding sequences with similar fluctuation patterns. Previous subsequence matching methods with normalization transform, however, would incur index overhead both in storage space and in update maintenance since they should build multiple indexes for supporting query sequences of arbitrary length. To solve this problem, we adopt a single-index approach in the normalization-transformed subsequence matching that supports query sequences of arbitrary length. For the single-index approach, we first provide the notion of inclusion-normalization transform by generalizing the original definition of normalization transform. To normalize a window, the inclusion-normalization transform uses the mean and the standard deviation of a subsequence that includes the window while the original transform uses those of the window itself. Next, we formally prove the correctness of the proposed normalization-transformed subsequence matching method that uses the inclusion-normalization transform. We then propose subsequence matching and index-building algorithms to implement the proposed method. Experimental results for real stock data show that our method improves performance by up to 2.5 similar to 2.8 times compared with the previous method.
引用
收藏
页码:2007 / 2018
页数:12
相关论文
共 21 条
[1]  
Agrawal R., 1995, VLDB '95. Proceedings of the 21st International Conference on Very Large Data Bases, P490
[2]  
Agrawal R., 1993, P 4 INT C FDN DAT OR, V730, P69
[3]  
[Anonymous], P ACM SIG MOD INT C
[4]  
BECKMANN N, 1990, SIGMOD REC, V19, P322, DOI 10.1145/93605.98741
[5]   The effect of glucagon-induced gastric relaxation on TLOSR frequency [J].
Chang, HY ;
Pandolfino, JE ;
Shi, G ;
Boeckxstaens, GE ;
Joehl, RJ ;
Kahrilas, PJ .
NEUROGASTROENTEROLOGY AND MOTILITY, 2003, 15 (01) :3-8
[6]  
Chu K. K. W., 1999, Proceedings of the Eighteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, P237, DOI 10.1145/303976.304000
[7]  
Keogh E., 2006, P 32 INT C VER LARG, P1268
[8]   Efficient processing of similarity search under time warping in sequence databases: an index-based approach [J].
Kim, SW ;
Park, S ;
Chu, WW .
INFORMATION SYSTEMS, 2004, 29 (05) :405-420
[9]  
LIM SH, 2006, P INT C DAT SYST ADV, P65
[10]   A subsequence matching algorithm that supports normalization transform in time-series databases [J].
Loh, WK ;
Kim, SW ;
Whang, KY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 9 (01) :5-28