An efficient similarity searching algorithm based on clustering for time series

被引:0
|
作者
Feng, Yucai [1 ]
Jiang, Tao [1 ]
Zhou, Yingbiao [1 ]
Li, Junkui [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Comp Sci & Technol, Wuhan 430074, Peoples R China
来源
ADVANCES IN DATA MINING, PROCEEDINGS: MEDICAL APPLICATIONS, E-COMMERCE, MARKETING, AND THEORETICAL ASPECTS | 2008年 / 5077卷
关键词
time series; clustering; similarity search; indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indexing large time series databases is crucial for efficient searching of time series queries. In the paper, we propose a novel indexing scheme RQI (Range Query based on Index) which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning. The basic idea is calculating wavelet coefficient whose first k coefficients are used to form a MBR. (minimal bounding rectangle) based on haar wavelet transform for each time series and then using point filtering method; At the same time, lower bounding and upper bounding feature of each time series is calculated, in advance, and stored into index structure. At last, triangle inequality pruning method is used by calculating the distance between time series beforehand. Then we introduce a novel lower bounding distance function SLBS (Symmetrical Lower Bounding based on Segment) and a novel clustering algorithm CSA (Clustering based on Segment Approximation) in order to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure. Extensive experiments over both synthetic and real datasets show that, our technologies provide perfect pruning power and could obtain an order of magnitude performance improvement for time series queries over traditional naive evaluation techniques.
引用
收藏
页码:360 / 373
页数:14
相关论文
共 50 条
  • [1] Clustering Algorithm Based on Time Series Similarity to Web Data Clustering
    Yang Yan
    Yao Hua-Xiong
    Li Rong
    PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1373 - 1377
  • [2] An Efficient Clustering Algorithm for Multivariate Time Series
    Zhou, Da-Zhuo
    Zhang, Bo
    EBM 2010: INTERNATIONAL CONFERENCE ON ENGINEERING AND BUSINESS MANAGEMENT, VOLS 1-8, 2010, : 5190 - 5193
  • [3] Clustering Algorithm for Time Series Based on Peak Interval
    Shan Jicheng
    Liu Weike
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 410 - 414
  • [4] A clustering algorithm for time series data
    Yin, Jian
    Zhou, Duanning
    Xie, Qiong-Qiong
    SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 119 - +
  • [5] Similarity Measure Selection for Clustering Time Series Databases
    Mori, Usue
    Mendiburu, Alexander
    Lozano, Jose A.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 181 - 195
  • [6] A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks
    Ding, Xin
    Hao, Kuangrong
    Cai, Xin
    Tang, Xue-Song
    Chen, Lei
    Zhang, Haichao
    IEEE ACCESS, 2020, 8 : 173158 - 173168
  • [7] Cluster-Based Similarity Search in Time Series
    Karamitopoulos, Leonidas
    Evangelidis, Georgios
    PROCEEDINGS OF THE 2009 FOURTH BALKAN CONFERENCE IN INFORMATICS, 2009, : 113 - 118
  • [8] Clustering Algorithm for Time Series with Similar Shapes
    Ahn, Jungyu
    Lee, Ju-Hong
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (07): : 3112 - 3127
  • [9] Multi-measure Similarity Searching for Time Series
    Wang, Jimin
    Zhu, Yuelong
    Wan, Dingsheng
    Zhang, Pengcheng
    Feng, Jun
    JOURNAL OF COMPUTERS, 2014, 9 (10) : 2266 - 2273
  • [10] Underlying techniques of efficient similarity search on time series
    Feng, Yu-Cai
    Jiang, Tao
    Li, Guo-Hui
    Zhu, Hong
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (11): : 2107 - 2122