An improved frequent pattern growth method for mining association rules

被引：63

作者：

Lin, Ke-Chung ^{[1
]}

Liao, I-En ^{[1
]}

Chen, Zhi-Sheng ^{[1
]}

机构：

[1] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung 40227, Taiwan

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2011年 / 38卷 / 05期

关键词：

FP-tree; Frequent itemset mining; Association rules;

D O I：

10.1016/j.eswa.2010.10.047

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many algorithms have been proposed to efficiently mine association rules. One of the most important approaches is FP-growth. Without candidate generation, FP-growth proposes an algorithm to compress information needed for mining frequent itemsets in FP-tree and recursively constructs FP-trees to find all frequent itemsets. Performance results have demonstrated that the FP-growth method performs extremely well. In this paper, we propose the IFP-growth (improved FP-growth) algorithm to improve the performance of FP-growth. There are three major features of IFP-growth. First, it employs an address-table structure to lower the complexity of forming the entire FP-tree. Second, it uses a new structure called FP-tree(+) to reduce the need for building conditional FP-trees recursively. Third, by using address-table and FP-tree the proposed algorithm has less memory requirement and better performance in comparison with FP-tree based algorithms. The experimental results show that the IFP-growth requires relatively little memory space during the mining process. Even when the minimum support is low, the space needed by IFP-growth is about one half of that of FP-growth and about one fourth of that of nonordfp algorithm. As to the execution time, our method outperforms FP-growth by one to 300 times under different minimum supports. The proposed algorithm also outperforms nonordfp algorithm in most cases. As a result, IFP-growth is very suitable for high performance applications. (C) 2010 Elsevier Ltd. All rights reserved.

引用

页码：5154 / 5161

页数：8

共 10 条

[1]

Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072

[2]

Agrawal R., 1994, VLDB 1994, P487

[3] Fast algorithms for frequent itemset mining using FP-trees [J].

Grahne, G ;

Zhu, JF .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) :1347-1362

[4]

HAN J, 2000, P 2000 ACM SIGMOD IN, P1, DOI DOI 10.1145/342009.335372

[5] Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].

Han, JW ;

Pei, J ;

Yin, YW ;

Mao, RY .

DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87

[6] Mining frequent itemsets over data streams using efficient window sliding techniques [J].

Li, Hua-Fu ;

Lee, Suh-Yin .

EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :1466-1477

[7]

ORLANDO S., 2003, P IEEE ICDM WORKSH F

[8]

Park JS, 1997, IEEE T KNOWL DATA EN, V9, P813, DOI 10.1109/69.634757

[9]

RACZ B, 2004, P IEEE ICDM WORKSH F

[10]

Zaki M. J., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P283

← 1 →