CanTree: a canonical-order tree for incremental frequent-pattern mining

被引:99
作者
Leung, Carson Kai-Sang [1 ]
Khan, Quamrul I. [1 ]
Li, Zhan [1 ]
Hoque, Tariqul [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
关键词
knowledge discovery and data mining; tree structure; frequent sets; incremental mining; constrained mining; interactive mining;
D O I
10.1007/s10115-006-0032-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since its introduction, frequent-pattern mining has been the subject of numerous studies, including incremental updating. Many existing incremental mining algorithms are Apriori-based, which are not easily adoptable to FP-tree-based frequent-pattern mining. In this paper, we propose a novel tree structure, called CanTree (canonical-order tree), that captures the content of the transaction database and orders tree nodes according to some canonical order. By exploiting its nice properties, the CanTree can be easily maintained when database transactions are inserted, deleted, and/or modified. For example, the CanTree does not require adjustment, merging, and/or splitting of tree nodes during maintenance. No rescan of the entire updated database or reconstruction of a new tree is needed for incremental updating. Experimental results show the effectiveness of our CanTree in the incremental mining of frequent patterns. Moreover, the applicability of CanTrees is not confined to incremental mining; CanTrees can also be applicable to other frequent-pattern mining tasks including constrained mining and interactive mining.
引用
收藏
页码:287 / 311
页数:25
相关论文
共 38 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Agrawal R., 1994, Proceedings of the 20th International Conference on Very Large Data Bases. VLDB'94, P487
[3]  
[Anonymous], P 1998 ACM SIGMOD IN
[4]  
[Anonymous], P 1996 ACM SIGMOD IN
[5]  
[Anonymous], P 1998 ACM SIGMOD IN
[6]  
[Anonymous], P ACM SIGMOD 98
[7]  
[Anonymous], 1999, KDD
[8]  
Blake C.L., 1998, UCI repository of machine learning databases
[9]   On closed constrained frequent pattern mining [J].
Bonchi, F ;
Lucchese, C .
FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, :35-42
[10]   Efficient breadth-first mining of frequent pattern with monotone constraints [J].
Bonchi, F ;
Giannotti, F ;
Mazzanti, A ;
Pedreschi, D .
KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (02) :131-153