Maximum item first pattern growth for mining frequent patterns

被引:0
作者
Fan, HJ [1 ]
Fan, M
Wang, BZ
机构
[1] Univ Melbourne, Dept CSSE, Parkville, Vic 3052, Australia
[2] Zhengzhou Univ, Dept Comp Sci, Zhengzhou, Peoples R China
来源
ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING | 2003年 / 2639卷
关键词
data mining; frequent patterns; FP-tree; association rules;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent pattern mining plays an essential role in many important data mining tasks. FP-Growth, an algorithm which mines frequent patterns in the frequent pattern tree (FP-tree), is very efficient. However, it still encounters performance bottlenecks when creating conditional FP-trees recursively during the mining process. In this work, we propose a new algorithm, called Maximum-Item-First Pattern Growth (MIFPG), for mining frequent patterns. MIFPG searches the FP-tree in the depth-first, top-down manner, as opposed to the bottom-up order of FP-Growth. Its key idea is that maximum items are always considered first when the current pattern grows. In this way, no concrete realization of conditional pattern bases is needed and the major operations of mining are counting and link adjusting, which are usually inexpensive. Experiments show that, in comparison with FP-Growth, our algorithm is about three times faster and consumes less memory space; it also has good time and space scalability with the number of transactions.
引用
收藏
页码:515 / 523
页数:9
相关论文
共 11 条
  • [1] AGARWA RC, 2001, J PARALLEL DISTRIBUT, V61
  • [2] AGRAWAL R, P ICDE 95 TAIP
  • [3] Agrawal R., 1994, P 20 INT C VER LARG, V1215, P487
  • [4] [Anonymous], P 1998 ACM SIGMOD IN
  • [5] [Anonymous], 1999, Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining p, DOI [10.1145/312129., DOI 10.1145/312129, 10.1145/312129, 10.1145/312129.312191]
  • [6] BRIN S, 1998, DATA MINING KNOWLEDG, V2
  • [7] FAN H, P PAKDD 02 TAIP
  • [8] HAN J, 2000, P 2000 ACM SIGMOD IN, P1, DOI DOI 10.1145/342009.335372
  • [9] Efficient mining of partial periodic patterns in time series database
    Han, JW
    Dong, GZ
    Yin, YW
    [J]. 15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 106 - 115
  • [10] PEI J, P ICDM 01 SAN JOS CA