An efficient similarity searching algorithm based on clustering for time series

被引:0
|
作者
Feng, Yucai [1 ]
Jiang, Tao [1 ]
Zhou, Yingbiao [1 ]
Li, Junkui [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Comp Sci & Technol, Wuhan 430074, Peoples R China
来源
ADVANCES IN DATA MINING, PROCEEDINGS: MEDICAL APPLICATIONS, E-COMMERCE, MARKETING, AND THEORETICAL ASPECTS | 2008年 / 5077卷
关键词
time series; clustering; similarity search; indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indexing large time series databases is crucial for efficient searching of time series queries. In the paper, we propose a novel indexing scheme RQI (Range Query based on Index) which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning. The basic idea is calculating wavelet coefficient whose first k coefficients are used to form a MBR. (minimal bounding rectangle) based on haar wavelet transform for each time series and then using point filtering method; At the same time, lower bounding and upper bounding feature of each time series is calculated, in advance, and stored into index structure. At last, triangle inequality pruning method is used by calculating the distance between time series beforehand. Then we introduce a novel lower bounding distance function SLBS (Symmetrical Lower Bounding based on Segment) and a novel clustering algorithm CSA (Clustering based on Segment Approximation) in order to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure. Extensive experiments over both synthetic and real datasets show that, our technologies provide perfect pruning power and could obtain an order of magnitude performance improvement for time series queries over traditional naive evaluation techniques.
引用
收藏
页码:360 / 373
页数:14
相关论文
共 50 条
  • [21] An Expanding Clustering Algorithm Based on Density Searching
    Tan, Liguo
    Liu, Yang
    Chen, Xinglin
    INFORMATION AND MANAGEMENT ENGINEERING, PT VI, 2011, 236 : 110 - 116
  • [22] MDL-based time series clustering
    Thanawin Rakthanmanon
    Eamonn J. Keogh
    Stefano Lonardi
    Scott Evans
    Knowledge and Information Systems, 2012, 33 : 371 - 399
  • [23] An Improved Algorithm of Similarity Based on Clustering in XML
    Wang, Puqing
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 837 - 841
  • [24] MDL-based time series clustering
    Rakthanmanon, Thanawin
    Keogh, Eamonn J.
    Lonardi, Stefano
    Evans, Scott
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 33 (02) : 371 - 399
  • [25] Clustering time-series by a novel slope-based similarity measure considering particle swarm optimization
    Kamalzadeh, Hossein
    Ahmadi, Abbas
    Mansour, Saeed
    APPLIED SOFT COMPUTING, 2020, 96
  • [26] Context-aware edge similarity segmentation algorithm of time series
    Wang, Lei
    Xu, Lingyu
    Yu, Jie
    Xue, Yunlan
    Zhang, Gaowei
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (03): : 1421 - 1436
  • [27] Context-aware edge similarity segmentation algorithm of time series
    Lei Wang
    Lingyu Xu
    Jie Yu
    Yunlan Xue
    Gaowei Zhang
    Cluster Computing, 2016, 19 : 1421 - 1436
  • [28] Set-based Similarity Search for Time Series
    Peng, Jinglin
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2039 - 2052
  • [29] Incremental Clustering for Time Series Data based on an Improved Leader Algorithm
    Huynh Thi Thu Thuy
    Duong Tuan Anh
    Vo Thi Ngoc Chau
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 13 - 18
  • [30] Fuzzy clustering algorithm for time series based on adaptive incremental learning
    Wang, Wei
    Hu, Xiaohui
    Wang, Mingye
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) : 3991 - 3998