A New Hybrid Algorithm for Association Rule Mining

被引:0
作者
张敏聪
燕存良
朱开玉
机构
[1] NationalDieandMoldCADEngineeringResearchCenter,ShanghaiJiaotongUniversity
关键词
association rule; data mining; hashing; database analysis;
D O I
10.19884/j.1672-5220.2007.05.006
中图分类号
TP301.6 [算法理论];
学科分类号
081202 ;
摘要
HA(hashing array),a new algorithm,for mining frequent itemsets of large database is proposed.It employs a structure hash array,ItemArray() to store the information of database and then uses it instead of database in later iteration.By this improvement,only twice scanning of the whole database is necessary,thereby the computational cost can be reduced significantly.To overcome the performance bottleneck of frequent 2-itemsets mining,a modified algorithm of HA,DHA(direct-addressing hashing and array) is proposed,which combines HA with direct-addressing hashing technique.The new hybrid algorithm,DHA,not only overcomes the performance bottleneck but also inherits the advantages of HA.Extensive simulations are conducted in this paper to evaluate the performance of the proposed new algorithm,and the results prove the new algorithm is more efficient and reasonable.
引用
收藏
页码:598 / 603
页数:6
相关论文
共 2 条
[1]   关联规则挖掘Apriori算法的改进与实现 [J].
陈文庆 ;
许棠 .
微机发展, 2005, (08) :155-157
[2]   挖掘关联规则中Apriori算法的改进 [J].
马盈仓 .
计算机应用与软件, 2004, (11) :82-84