Upper bounds for can-tree and FP-tree

被引:0
作者
Shahbazi, Nima [1 ]
Gryz, Jarek [2 ]
机构
[1] Mindle Inc, Toronto, ON, Canada
[2] York Univ, Toronto, ON, Canada
关键词
Data mining; Frequent item sets; Pattern mining; Upper bound;
D O I
10.1007/s10844-021-00673-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two efficient tree structures known as Can-tree (Leung et al., Knowledge and Information Systems, 11(3), 287-311, 2007) and FP-tree (Han et al., 2000) are used to store a database in memory for mining frequent patterns. However, there has been no discussion on tight upper bound for the number of nodes in these trees. Instead, a very loose upper bound of 2(n) (where n is the number of distinct items in the database) is used. In this paper, we provide a tighter upper bound for the number of nodes in Can-tree and FP-tree. The upper bound on the number of nodes is provided through a greedy algorithm for the Can-tree and a closed form solution is derived for the FP-tree. These results are illustrated by examples both in graphical and mathematical form.
引用
收藏
页码:197 / 222
页数:26
相关论文
共 10 条
[1]   PrePost+: An efficient N-lists-based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning [J].
Deng, Zhi-Hong ;
Lv, Sheng-Long .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) :5424-5432
[2]   A new algorithm for fast mining frequent itemsets using N-lists [J].
Deng ZhiHong ;
Wang ZhongHui ;
Jiang JiaJian .
SCIENCE CHINA-INFORMATION SCIENCES, 2012, 55 (09) :2008-2030
[3]  
Han JW, 2000, SIGMOD RECORD, V29, P1
[4]   CanTree: a canonical-order tree for incremental frequent-pattern mining [J].
Leung, Carson Kai-Sang ;
Khan, Quamrul I. ;
Li, Zhan ;
Hoque, Tariqul .
KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 11 (03) :287-311
[5]   CFP-tree: A compact disk-based structure for storing and querying frequent itemsets [J].
Liu, Guimei ;
Lu, Honjun ;
Yu, Jeffrey Xu .
INFORMATION SYSTEMS, 2007, 32 (02) :295-319
[6]   H-mine: Hyper-structure mining of frequent patterns in large databases [J].
Pei, J ;
Han, JW ;
Lu, HJ ;
Nishio, S ;
Tang, SW ;
Yang, DQ .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :441-448
[7]  
Perner P., 2018, LECT NOTES COMPUTER, V10935, P16
[8]  
Schlegel B., 2011, P 14 INT C EXT DAT T, P461
[9]  
Shahbazil N, 2020, LECT NOTES COMPUT SC, V12245, P23, DOI 10.1007/978-3-030-54832-2_4
[10]  
Zaki, 2004, CEUR WORKSHOP PROC, V126