Efficiently mining frequent itemsets with compact FP-tree

被引:0
作者
Qin, LX [1 ]
Luo, P [1 ]
Shi, ZZ [1 ]
机构
[1] Chinese Acad Sci, Comp Technol Inst, Key Lab Intelligent Informat Proc, Beijing 100080, Peoples R China
来源
INTELLIGENT INFORMATION PROCESSING II | 2005年 / 163卷
关键词
association rule; frequent patterns; compact FP-tree;
D O I
10.1007/0-387-23152-8_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
FP-growth algorithm is an efficient algorithm for mining frequent patterns. It scans database only twice and does not need to generate and test the candidate sets that is quite time consuming. The efficiency of the FP-growth algorithm outperforms previously developed algorithms. But, it must recursively generate huge number of conditional FP-trees that requires much more memory and costs more time. In this paper, we present an algorithm, CFPmine, that is inspired by several previous works. CFPmine algorithm combines several advantages of existing techniques. One is using constrained subtrees of a compact FP-tree to mine frequent pattern, so that it is doesn't need to construct conditional FP-trees in the mining process. Second is using an array-based technique to reduce the traverse time to the CFP-tree. And an unified memeory management is also implemented in the algorithm. The experimental evaluation shows that CFPmine algorithm is a high performance algorithm. It outperforms Apriori, Eclat and FP-growth and requires less memory than FP-growth.
引用
收藏
页码:397 / 406
页数:10
相关论文
共 50 条
[41]   Cherry: An algorithm for mining frequent closed itemsets without subset checking [J].
Tao, Li-Min ;
Huang, Lin-Peng .
Ruan Jian Xue Bao/Journal of Software, 2008, 19 (02) :379-388
[42]   Mining frequent patterns with the pattern tree [J].
Huang, H ;
Wu, XD ;
Relue, R .
NEW GENERATION COMPUTING, 2005, 23 (04) :315-337
[43]   Frequent tree pattern mining: A survey [J].
Jimenez, Aida ;
Berzal, Fernando ;
Cubero, Juan-Carlos .
INTELLIGENT DATA ANALYSIS, 2010, 14 (06) :603-622
[44]   Mining frequent patterns with the pattern tree [J].
Hao Huang ;
Xindong Wu ;
Richard Relue .
New Generation Computing, 2005, 23 :315-337
[45]   Mining frequent itemsets in data streams using the weighted sliding window model [J].
Tsai, Pauray S. M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (09) :11617-11625
[46]   Approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data [J].
Cuzzocrea, Alfredo ;
Leung, Carson K. ;
MacKinnon, Richard Kyle .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015, 2015, 60 :613-622
[47]   A fast Parallel Association Rule Mining Algorithm Based on the Probability of Frequent Itemsets [J].
Mohamed, Marghny H. ;
Refaat, Hosam E. .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (05) :152-162
[48]   MINING FREQUENT PATTERNS BY THE GROUPING COMPRESSION TREE [J].
Chang, Ray-I ;
Chuang, Chi-Cheng .
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2013, 9 (08) :3115-3132
[49]   Cluster based bit vector mining algorithm for finding frequent itemsets in temporal databases [J].
Krishnamurthy, M. ;
Kannan, A. ;
Baskaran, R. ;
Kavitha, M. .
WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
[50]   Mining Top-k Frequent-regular Itemsets from Incremental Transactional Database [J].
Tagmatcha, Bandit ;
Amphawan, Komate .
2018 5TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS (ICAICTA 2018), 2018, :231-237