An efficient distributed algorithm for mining association rules

被引:0
作者
Farzanyar, Zahra [1 ]
Kangavari, Mohammadreza [2 ]
Hashemi, Sattar [2 ]
机构
[1] Iran Univ Sci & Technol, SECOMP Lab, Dept Comp & IT, Tehran, Iran
[2] Iran Univ Sci & Technol, Dept Comp & IT, Tehran, Iran
来源
PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS | 2006年 / 4330卷
关键词
distributed data mining; association rules; distributed databases;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Association Rule Mining (ARM) is an active data mining research area. However, most ARM algorithms cater to a centralized environment where no external communication is required. Distributed Association Rule Mining (DARM) algorithms aim to generate rules from different datasets spread over various geographical sites; hence, they require external communications throughout the entire processor. A direct application of sequential algorithms to distributed databases is not effective, because it requires a large amount of communication overhead. DARM algorithms must reduce communication costs. In this paper, a new solution is proposed to reduce the size of message exchanges. Our solution also reduces the size of average transactions and datasets that leads to reduction of scan time, which is very effective in increasing the performance of the proposed algorithm. Our performance study shows that this solution has a better performance over the direct application of a typical sequential algorithm.
引用
收藏
页码:383 / +
页数:3
相关论文
共 24 条
  • [1] Parallel mining of association rules
    Agrawal, R
    Shafer, JC
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) : 962 - 969
  • [2] AGRAWAL R, 1994, P 20 INTL C VER LARG, P40
  • [3] [Anonymous], 1996, Advances in Knowledge Discovery and Data Mining, DOI DOI 10.1007/978-3-319-31750-2.
  • [4] [Anonymous], P INT C VER LARG DAT
  • [5] ASHRAFI MZ, 1955, IEEE DISTRIBUTED SYS, V5
  • [6] Blake C.L., 1998, UCI repository of machine learning databases
  • [7] CHEUNG D, 1996, IEEE T KNOWLEDGE DAT
  • [8] Cheung DW, 1996, PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED INFORMATION SYSTEMS, P31, DOI 10.1109/PDIS.1996.568665
  • [9] GANTI V, 1999, MINING VERY LARGE DA
  • [10] HAN EH, 2000, IEEE T KNOWLEDGE DAT