A New Data Structure to Enhance the Speed of Frequent Pattern Mining

被引:0
作者
Derakhshan, Reza [1 ]
Ahmadi, Ali [1 ]
机构
[1] KN Toosi Univ Technol, Fac Comp Engn, Tehran, Iran
来源
2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE) | 2017年
关键词
data mining; frequent pattern mining; fast mining; data structure; PSC; ALGORITHMS; ITEMSETS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Frequent pattern mining is a crucial task in data mining and plays an important role in data mining applications. The PrePost(+) algorithm is one of the most leading algorithms in this field which has taken an important step toward frequent pattern mining by utilizing PPC data structure in the tree and lists. In this paper, we present a novel data structure, namely, Pre-order Size Code (PSC) to replace PPC data structure with so as to speed up tree building and completion phases and consequently the speed of frequent pattern mining. In contrast to PPC data structure, there is no need for separate pre-order and post-order tree traversals to complete the PSC in tree nodes. To evaluate the performance of the PrePost(+) with the PSC proposed data structure which named PreSize algorithm, we have conducted experiments to compare it with three state-of-the-art algorithms, on a variety of real data sets. The experimental results show that PreSize is a high performance on running time.
引用
收藏
页码:2128 / 2133
页数:6
相关论文
共 13 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Agrawal R., P 20 INT C VERY LARG
[3]   MAFIA: A maximal frequent itemset algorithm [J].
Burdick, D ;
Calimlim, M ;
Flannick, J ;
Gehrke, J ;
Yiu, TM .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (11) :1490-1504
[4]   PrePost+: An efficient N-lists-based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning [J].
Deng, Zhi-Hong ;
Lv, Sheng-Long .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) :5424-5432
[5]   Fast mining frequent itemsets using Nodesets [J].
Deng, Zhi-Hong ;
Lv, Sheng-Long .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (10) :4505-4512
[6]   A new algorithm for fast mining frequent itemsets using N-lists [J].
Deng ZhiHong ;
Wang ZhongHui ;
Jiang JiaJian .
SCIENCE CHINA-INFORMATION SCIENCES, 2012, 55 (09) :2008-2030
[7]  
Deng ZH, 2010, INT J COMPUT INT SYS, V3, P733
[8]   Fast algorithms for frequent itemset mining using FP-trees [J].
Grahne, G ;
Zhu, JF .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) :1347-1362
[9]  
HAN J, 2000, P 2000 ACM SIGMOD IN, P1, DOI DOI 10.1145/342009.335372
[10]  
LI Q, 2001, P VLDB