One Database Pass Algorithms of Mining Top-k Frequent Closed Itemsets

被引:0
作者
Qiu, Yong [1 ]
Lan, Yong-Jie [1 ]
机构
[1] Shandong Inst Business & Technol, Sch Comp Sience & Technol, Yantai 264005, Peoples R China
来源
ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION | 2009年
关键词
Data mining; Frequent Closed Itemsets; Top-k;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The FP-growth algorithm is a powerful algorithm to mine frequent patterns and it is non-candidate generation algorithm using a special structure FP-tree. Many algorithms proposed recently are based on FP-tree. These algorithms include all frequent itemsets mining, closed frequent itemsets mining and top-k closed frequent itemsets mining. However, it still requires two database scans, Although it is not a problem for static database, it is not efficient for frequent pattern mining, interactive and incremental mining. In order to enhance the efficiency of FP-tree based algorithms, propose a novel algorithm called QFPC which can create FP-tree with one database pass. Also propose a novel algorithm PFPTC to create FP-tree parallelly.
引用
收藏
页码:828 / 833
页数:6
相关论文
共 17 条
[1]  
Agrawal R., VLDB 94, P487
[2]  
[Anonymous], 2000, P 2000 ACM SIGMOD IN
[3]  
BAYARDO RJ, SIGMOD 98, P85
[4]  
Brin S., 1997, SIGMOD Record, V26, P265, DOI [10.1145/253262.253325, 10.1145/253262.253327]
[5]  
GRAHNE G, 2006, IEEE T KNOWLEDGE DAT, V17
[6]  
Grahne Gosta., 2003, P SIAM 03 WORKSHOP H, P135
[7]  
LIN DI, 1998, P 6 EUR C EXT DAT TE
[8]  
PARK JS, 1996, P 1995 ACM SIGMOD C, P175
[9]  
PASQUIER N, 2005, P INT C DAT THEOR JA, P398
[10]  
PEI J, 2000, P 2000 ACM SIGMOD IN