An efficient approach to mine flexible periodic patterns in time series databases

被引:24
作者
Chanda, Ashis Kumar [1 ]
Saha, Swapnil [1 ]
Nishi, Manziba Akanda [2 ]
Samiullah, Md. [1 ]
Ahmed, Chowdhury Farhan [3 ]
机构
[1] Univ Dhaka, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Bangladesh Open Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[3] Univ Strasbourg, ICube Lab, Strasbourg, France
关键词
Data mining; Time series databases; Periodic pattern; Suffix tree; Flexible patterns; Knowledge discovery; ONLINE;
D O I
10.1016/j.engappai.2015.04.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Periodic pattern mining in time series databases is one of the most interesting data mining problems that is frequently appeared in many real-life applications. Some of the existing approaches find fixed length periodic patterns by using suffix tree structure, i.e., unable to mine flexible patterns. One of the existing approaches generates periodic patterns by skipping intermediate events, i.e., flexible patterns, using apriori based sequential pattern mining approach. Since, apriori based approaches suffer from the issues of huge amount of candidate generation and large percentage of false pattern pruning, we propose an efficient algorithm FPPM (Flexible Periodic Pattern Mining) using suffix trie data structure. The proposed algorithm can capture more effective variable length flexible periodic patterns by neglecting unimportant or undesired events and considering only the important events in an efficient way. To the best of our knowledge, ours is the first approach that simultaneously handles various starting position throughout the sequences, flexibility among events in the mined patterns and interactive tuning of period values on the go. Complexity analysis of the proposed approach and comparison with existing approaches along with analytical comparison on various issues have been performed. As well as extensive experimental analyses are conducted to evaluate the performance of proposed FPPM algorithm using real-life datasets. The proposed approach outperforms existing algorithms in terms of processing time, scalability, and quality of mined patterns. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:46 / 63
页数:18
相关论文
共 42 条