An efficient similarity searching algorithm based on clustering for time series

被引:0
|
作者
Feng, Yucai [1 ]
Jiang, Tao [1 ]
Zhou, Yingbiao [1 ]
Li, Junkui [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Comp Sci & Technol, Wuhan 430074, Peoples R China
来源
ADVANCES IN DATA MINING, PROCEEDINGS: MEDICAL APPLICATIONS, E-COMMERCE, MARKETING, AND THEORETICAL ASPECTS | 2008年 / 5077卷
关键词
time series; clustering; similarity search; indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indexing large time series databases is crucial for efficient searching of time series queries. In the paper, we propose a novel indexing scheme RQI (Range Query based on Index) which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning. The basic idea is calculating wavelet coefficient whose first k coefficients are used to form a MBR. (minimal bounding rectangle) based on haar wavelet transform for each time series and then using point filtering method; At the same time, lower bounding and upper bounding feature of each time series is calculated, in advance, and stored into index structure. At last, triangle inequality pruning method is used by calculating the distance between time series beforehand. Then we introduce a novel lower bounding distance function SLBS (Symmetrical Lower Bounding based on Segment) and a novel clustering algorithm CSA (Clustering based on Segment Approximation) in order to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure. Extensive experiments over both synthetic and real datasets show that, our technologies provide perfect pruning power and could obtain an order of magnitude performance improvement for time series queries over traditional naive evaluation techniques.
引用
收藏
页码:360 / 373
页数:14
相关论文
共 50 条
  • [31] AN APPROACH FOR TIME SERIES SIMILARITY SEARCH BASED ON LUCENE
    Chang, Min
    Lou, Yuansheng
    Qiu, Lei
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 210 - 214
  • [32] Similarity search algorithm for multivariate time series based on empirical mode decomposition
    Wang, Yan
    Han, Meng
    Ma, Qianqian
    Journal of Computational Information Systems, 2014, 10 (08): : 3247 - 3254
  • [33] A Shape Based Similarity Measure for Time Series Classification with Weighted Dynamic Time Warping Algorithm
    Ye, Yanqing
    Niu, Caiyun
    Jiang, Jiang
    Ge, Bingfeng
    Yang, Kewei
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 104 - 109
  • [34] An Efficient Similarity Search For Financial Multivariate Time Series
    Zhou, Dazhuo
    Li, Minqiang
    Yan, Hongcan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11161 - 11164
  • [35] An evolutionary K-means algorithm for clustering time series data
    Zhang, H
    Ho, TB
    Lin, MS
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1282 - 1287
  • [36] The Clustering Analysis of the Load Model Based on the Time Series
    Xu, Yan-hui
    Song, Ge
    Zhang, Lan-yu
    Lin, Yong
    Fan, Yang
    INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC ENGINEERING (EEE 2014), 2014, : 315 - 319
  • [37] Efficient Time-Series Clustering through Sparse Gaussian Modeling
    Fotakis, Dimitris
    Patsilinakos, Panagiotis
    Psaroudaki, Eleni
    Xefteris, Michalis
    ALGORITHMS, 2024, 17 (02)
  • [38] A novel clustering algorithm for time-series data based on precise correlation coefficient matching in the IoT
    Li, Haibo
    Tong, Juncheng
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (06) : 6654 - 6671
  • [39] The unordered time series fuzzy clustering algorithm based on the adaptive incremental learning
    Xu, Huanchun
    Hou, Rui
    Fan, Jinfeng
    Zhou, Liang
    Yue, Hongxuan
    Wang, Liusheng
    Liu, Jiayue
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) : 3783 - 3791
  • [40] Analysis of fMRI Time Series: Neutrosophic-Entropy Based Clustering Algorithm
    Singh, Pritpal
    Watorek, Marcin
    Ceglarek, Anna
    Fafrowicz, Magdalena
    Oswiecimka, Pawel
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (03) : 224 - 229