An improved frequent pattern growth method for mining association rules

被引:63
作者
Lin, Ke-Chung [1 ]
Liao, I-En [1 ]
Chen, Zhi-Sheng [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung 40227, Taiwan
关键词
FP-tree; Frequent itemset mining; Association rules;
D O I
10.1016/j.eswa.2010.10.047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many algorithms have been proposed to efficiently mine association rules. One of the most important approaches is FP-growth. Without candidate generation, FP-growth proposes an algorithm to compress information needed for mining frequent itemsets in FP-tree and recursively constructs FP-trees to find all frequent itemsets. Performance results have demonstrated that the FP-growth method performs extremely well. In this paper, we propose the IFP-growth (improved FP-growth) algorithm to improve the performance of FP-growth. There are three major features of IFP-growth. First, it employs an address-table structure to lower the complexity of forming the entire FP-tree. Second, it uses a new structure called FP-tree(+) to reduce the need for building conditional FP-trees recursively. Third, by using address-table and FP-tree the proposed algorithm has less memory requirement and better performance in comparison with FP-tree based algorithms. The experimental results show that the IFP-growth requires relatively little memory space during the mining process. Even when the minimum support is low, the space needed by IFP-growth is about one half of that of FP-growth and about one fourth of that of nonordfp algorithm. As to the execution time, our method outperforms FP-growth by one to 300 times under different minimum supports. The proposed algorithm also outperforms nonordfp algorithm in most cases. As a result, IFP-growth is very suitable for high performance applications. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5154 / 5161
页数:8
相关论文
共 10 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Agrawal R., 1994, VLDB 1994, P487
[3]   Fast algorithms for frequent itemset mining using FP-trees [J].
Grahne, G ;
Zhu, JF .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) :1347-1362
[4]  
HAN J, 2000, P 2000 ACM SIGMOD IN, P1, DOI DOI 10.1145/342009.335372
[5]   Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].
Han, JW ;
Pei, J ;
Yin, YW ;
Mao, RY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87
[6]   Mining frequent itemsets over data streams using efficient window sliding techniques [J].
Li, Hua-Fu ;
Lee, Suh-Yin .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :1466-1477
[7]  
ORLANDO S., 2003, P IEEE ICDM WORKSH F
[8]  
Park JS, 1997, IEEE T KNOWL DATA EN, V9, P813, DOI 10.1109/69.634757
[9]  
RACZ B, 2004, P IEEE ICDM WORKSH F
[10]  
Zaki M. J., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P283