A Novel Parallel Scheme for Fast Similarity Search in Large Time Series

被引:9
作者
Yin Hong [1 ,3 ]
Yang Shuqiang [1 ]
Ma Shaodong [2 ]
Liu Fei [1 ]
Chen Zhikun [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
[2] Univ Hull, Sch Engn, Kingston Upon Hull HU6 7RX, N Humberside, England
[3] Xiangyang Sch NCOs, Xiangyang 441118, Peoples R China
基金
中国国家自然科学基金;
关键词
similarity; DTW; warping path; time series; MapReduce; parallelization; cluster;
D O I
10.1109/CC.2015.7084408
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The similarity search is one of the fundamental components in time series data mining, e.g. clustering, classification, association rules mining. Many methods have been proposed to measure the similarity between time series, including Euclidean distance, Manhattan distance, and dynamic time warping (DTW). In contrast, DTW has been suggested to allow more robust similarity measure and be able to find the optimal alignment in time series. However, due to its quadratic time and space complexity, DTW is not suitable for large time series datasets. Many improving algorithms have been proposed for DTW search in large databases, such as approximate search or exact indexed search. Unlike the previous modified algorithm, this paper presents a novel parallel scheme for fast similarity search based on DTW, which is called MRDTW (MapRedcue-based DTW). The experimental results show that our approach not only retained the original accuracy as DTW, but also greatly improved the efficiency of similarity measure in large time series.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 25 条
  • [11] Debregeas A., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P179
  • [12] Fink E, 2003, IEEE SYS MAN CYBERN, P2332
  • [13] Identification of ancient coins based on fusion of shape and local features
    Huber-Moerk, Reinhold
    Zambanini, Sebastian
    Zaharieva, Maia
    Kampel, Martin
    [J]. MACHINE VISION AND APPLICATIONS, 2011, 22 (06) : 983 - 994
  • [14] Kadous MW, 1999, MACHINE LEARNING, PROCEEDINGS, P454
  • [15] Exact indexing of dynamic time warping
    Keogh, E
    Ratanamahatana, CA
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (03) : 358 - 386
  • [16] Locally adaptive dimensionality reduction for indexing large time series databases
    Keogh, E
    Chakrabarti, K
    Mehrotra, S
    Pazzani, M
    [J]. SIGMOD RECORD, 2001, 30 (02) : 151 - 162
  • [17] Keogh E. J., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P239
  • [18] Supporting exact indexing of arbitrarily rotated shapes and periodic time series under Euclidean and warping distance measures
    Keogh, Eamonn
    Wei, Li
    Xi, Xiaopeng
    Vlachos, Michail
    Lee, Sang-Hee
    Protopapas, Pavlos
    [J]. VLDB JOURNAL, 2009, 18 (03) : 611 - 630
  • [19] Keogh EJ, 1999, LECT NOTES ARTIF INT, V1704, P1
  • [20] Moody G. B., 1983, Computers in Cardiology 10th Annual Meeting (IEEE Cat. No. 83CH1927-3), P227