Improvements in the data partitioning approach for frequent itemsets mining

被引:0
作者
Nguyen, SN [1 ]
Orlowska, ME [1 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
来源
KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005 | 2005年 / 3721卷
关键词
association rules; frequent itemset; partition; performance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent Itemsets mining is well explored for various data types, and its computational complexity is well understood. There are methods to deal effectively with computational problems. This paper shows another approach to further performance enhancements of frequent items sets computation. We have made a series of observations that led us to inventing data preprocessing methods such that the final step of the Partition algorithm, where a combination of all local candidate sets must be processed, is executed on substantially smaller input data. The paper shows results from several experiments that confirmed our general and formally presented observations.
引用
收藏
页码:625 / 633
页数:9
相关论文
共 13 条
[1]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[2]  
[Anonymous], 1993, P 1993 ACM SIGMOD IN
[3]  
Brin S., 1997, SIGMOD Record, V26, P255, DOI [10.1145/253262.253327, 10.1145/253262.253325]
[4]  
Frasca, 2000, SIGKDD EXPLORATIONS, V2, P86, DOI [10.1145/380995.381033, DOI 10.1145/380995.381033]
[5]  
GOETHALS B, 2002, SURVEY FREQUENT PATT
[6]   Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].
Han, JW ;
Pei, J ;
Yin, YW ;
Mao, RY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87
[7]  
LIN JL, 1998, P 14 IEEE INT C DAT
[8]  
Mueller A., 1995, CSTR3515 U MAR
[9]  
NGUYEN SN, 2005, DATA PARTITIONING AP
[10]  
PUDI V, 2003, WORKSH FREQ IT MIN I