Nearest Neighbor Subsequence Search in Time Series Data

被引:0
|
作者
Ahsan, Ramoza [1 ]
Bashir, Muzammil [1 ]
Neamtu, Rodica [1 ]
Rundensteiner, Elke A. [1 ]
Sarkozy, Gabor [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
关键词
Time Series Data; Subsequence Mining; Nearest Neighbor Search;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continuous growth in sensor data and other temporal sequence data necessitates efficient retrieval and similarity search support On these big time series dalasels. However, finding exact similarity results, especially at the granularity of subsequences, is known to be prohibitively costly for large data sets. In this paper, we thus propose an efficient framework for solving this exact subsequence similarity match problem, called TINN (Time series Nearest Neighbor search). Exploiting the range interval diversity properties of time series datasets, TINN captures similarity at two levels of abstraction, namely, relationships among subsequences within each long time series and relationships across distinct time series in the data set. These relationships are compactly organized in an augmented relationship graph model, with the former relationships encoded in similarity vectors at TINN nodes and the later captured by augmented edge types in the TINN Graph. Query processing strategy deploy novel pruning techniques on the TINN Graph, including node skipping, vertical and horizontal pruning, to significantly reduce the number of time series as well as subsequences to be explored. Comprehensive experiments on synthetic and real world lime series data demonstrate that our T INN model consistently outperforms state-of-the-art approaches while still guaranteeing to retrieve exact matches.
引用
收藏
页码:2057 / 2066
页数:10
相关论文
共 50 条
  • [1] Study of forecasting on time series data by nearest neighbor method
    Hanakuma, Y
    Yamamoto, J
    KAGAKU KOGAKU RONBUNSHU, 2001, 27 (02) : 272 - 274
  • [2] Similar Subsequence Retrieval from Two Time Series Data Using Homology Search
    Nishii, Takuma
    Hiroyasu, Tomoyuki
    Yoshimi, Masato
    Miki, Mitsunori
    Yokouchi, Hisatake
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [3] Succinct nearest neighbor search
    Tellez, Eric Sadit
    Chavez, Edgar
    Navarro, Gonzalo
    INFORMATION SYSTEMS, 2013, 38 (07) : 1019 - 1030
  • [4] Projection Search For Approximate Nearest Neighbor
    Feng, Cheng
    Yang, Bo
    2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 33 - 38
  • [5] Hardness of Approximate Nearest Neighbor Search
    Rubinstein, Aviad
    STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2018, : 1260 - 1268
  • [6] A flexible framework to ease nearest neighbor search in multidimensional data spaces
    Barrena, Manuel
    Jurado, Elena
    Marquez-Neila, Pablo
    Pachon, Carlos
    DATA & KNOWLEDGE ENGINEERING, 2010, 69 (01) : 116 - 136
  • [7] Practical Nearest Neighbor Search in the Plane
    Connor, Michael
    Kumar, Piyush
    EXPERIMENTAL ALGORITHMS, PROCEEDINGS, 2010, 6049 : 501 - 512
  • [8] Fast Nearest Neighbor Search with Keywords
    Tao, Yufei
    Sheng, Cheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (04) : 878 - 888
  • [9] Efficient and Secure Nearest Neighbor Search Over Encrypted Data in Cloud Environment
    Upinder, Kaur
    Pushpa, R. Suri
    DATA SCIENCE AND ANALYTICS, 2018, 799 : 587 - 598
  • [10] Hierarchical Satellite System Graph for Approximate Nearest Neighbor Search on Big Data
    Zhang, Jiaru
    Ma, Ruhui
    Song, Tao
    Hua, Yang
    Xue, Zhengui
    Guan, Chenyang
    Guan, Haibing
    ACM/IMS Transactions on Data Science, 2021, 2 (04):