An efficient compression technique for frequent itemset generation in association rule mining

被引:0
作者
Ashrafi, MZ [1 ]
Taniar, D [1 ]
Smith, K [1 ]
机构
[1] Monash Univ, Sch Business Syst, Clayton, Vic 3800, Australia
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS | 2005年 / 3518卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Association Rule mining is one of the widely used data mining techniques. To achieve a better performance, many efficient algorithms have been proposed. Despite these efforts, we are often unable to complete a mining task because these algorithms require a large amount of main memory to enumerate all frequent itemsets, especially when dataset is large or the user-specified support is low. Thus, it becomes apparent that we need to have an efficient main memory handling technique, which allows association rule mining algorithms to handle larger datasets in main memory. To achieve this goal, in this paper we propose an algorithm for vertical association rule mining that compresses a vertical dataset in an efficient manner, using bit vectors. Our performance evaluations show that the compression ratio attained by our proposed technique is better than those of the other well known techniques.
引用
收藏
页码:125 / 135
页数:11
相关论文
共 15 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Agrawal R., 1994, P 20 INT C VER LARG, V1215, P407
[3]  
ASHOKA S, 1995, P VLDB ZUR SWIZ, P432
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[5]  
DOUG B, 2001, P 17 INT C DAT ENG H, P443
[6]  
ELHAJJ M, 2003, P 9 ACM SIGKDD
[7]  
GEOTHALS B, 2003, P ACM SAC
[8]  
GOETHALS B, 2003, FIMI REPOSITORY
[9]  
GOETHALS B, SURVEY FREQUENT PATT
[10]  
HAN J, 2000, P 2000 ACM SIGMOD IN, P1, DOI DOI 10.1145/342009.335372