Cluster-Based Similarity Search in Time Series

被引:4
|
作者
Karamitopoulos, Leonidas [1 ]
Evangelidis, Georgios [1 ]
机构
[1] Univ Macedonia, Dept Appl Informat, Thessaloniki, Greece
来源
PROCEEDINGS OF THE 2009 FOURTH BALKAN CONFERENCE IN INFORMATICS | 2009年
关键词
similarity search; clustering; time series; data mining; K-NN QUERIES; INDEX;
D O I
10.1109/BCI.2009.22
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we present a new method that accelerates similarity search implemented via one-nearest neighbor on time series data. The main idea is to identify the most similar time series to a given query without necessarily searching over the whole database. Our method is based on partitioning the search space by applying the K-means algorithm on the data. Then, similarity search is performed hierarchically starting from the cluster that lies most closely to the query. This procedure aims at reaching the most similar series without searching all clusters. In this work, we propose to reduce the intrinsically high dimensionality of time series prior to clustering by applying a well known dimensionality reduction technique, namely, the Piecewise Aggregate Approximation, for its simplicity and efficiency. Experiments are conducted on twelve real-world and synthetic datasets covering a wide range of applications.
引用
收藏
页码:113 / 118
页数:6
相关论文
共 50 条
  • [1] Cluster-based genetic segmentation of time series with DWT
    Tseng, Vincent S.
    Chen, Chun-Hao
    Huang, Pai-Chieh
    Hong, Tzung-Pei
    PATTERN RECOGNITION LETTERS, 2009, 30 (13) : 1190 - 1197
  • [2] A CLUSTER-BASED APPRAOCH TO CONTENT BASED TIME SERIES RETRIEVAL (CBTSR)
    Bovolo, Francesca
    Demir, Beguem
    Bruzzone, Lorenzo
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 2793 - 2796
  • [3] Set-based Similarity Search for Time Series
    Peng, Jinglin
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2039 - 2052
  • [4] Trend and Value based Time Series Representation for Similarity Search
    Kane, Aminata
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 252 - 259
  • [5] Time Series Similarity Search based on Middle Points and Clipping
    Nguyen Thanh Son
    Duong Tuan Anh
    2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 13 - 19
  • [6] Similarity Search Based on Random Projection for High Frequency Time Series
    Wu, Wei
    Hu, Jingtao
    2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 287 - +
  • [7] AN APPROACH FOR TIME SERIES SIMILARITY SEARCH BASED ON LUCENE
    Chang, Min
    Lou, Yuansheng
    Qiu, Lei
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 210 - 214
  • [8] Similarity search and pattern discovery in hydrological time series data mining
    Ouyang, Rulin
    Ren, Liliang
    Cheng, Weiming
    Zhou, Chenghu
    HYDROLOGICAL PROCESSES, 2010, 24 (09) : 1198 - 1210
  • [9] Isomorphism Distance in Multidimensional Time Series and Similarity Search
    Guo Wensheng
    Ji Lianen
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 : 209 - 217
  • [10] Underlying techniques of efficient similarity search on time series
    Feng, Yu-Cai
    Jiang, Tao
    Li, Guo-Hui
    Zhu, Hong
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (11): : 2107 - 2122