Communication-efficient distributed mining of association rules

被引:0
|
作者
Schuster, A [1 ]
Wolff, R [1 ]
机构
[1] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining for associations between items in large transactional databases is a central problem in the field of knowledge discovery. When the database is partitioned among several share-nothing machines, the problem can be addressed using distributed data mining algorithms. One such algorithm, called CD, was proposed by Agrawal and Shafer in [1] and was later enhanced by the FDM algorithm of Cheung, Han et al. [5]. The main problem with these algorithms is that they do not scale well with the number of partitions. They are thus impractical for use in modern distributed environments such as peer-to-peer systems, in which hundreds or thousands of computers may interact. In this paper we present a set of new algorithms that solve the Distributed Association Rule Mining problem using far less communication. In addition to being very efficient, the new algorithms are also extremely robust. Unlike existing algorithms, they continue to be efficient even when the data is skewed or the partition sizes are imbalanced. We present both experimental and theoretical results concerning the behavior of these algorithms and explain how they can be implemented in different settings.
引用
收藏
页码:473 / 484
页数:12
相关论文
共 50 条
  • [1] Communication-Efficient Distributed Mining of Association Rules
    Assaf Schuster
    Ran Wolff
    Data Mining and Knowledge Discovery, 2004, 8 : 171 - 196
  • [2] Communication-efficient distributed mining of association rules
    Schuster, A
    Wolff, R
    DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (02) : 171 - 196
  • [3] An efficient distributed algorithm for mining association rules
    Zhao, Yan
    Yao, Yong
    Liu, Zhijng
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 41 - 44
  • [4] Efficient mining of association rules in distributed databases
    Cheung, DW
    Ng, VT
    Fu, AW
    Fu, YJ
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) : 911 - 922
  • [5] An efficient algorithm for mining distributed association rules
    Li, YJ
    Lin, XM
    Tsang, CP
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1169 - 1175
  • [6] An efficient distributed algorithm for mining association rules
    Farzanyar, Zahra
    Kangavari, Mohammadreza
    Hashemi, Sattar
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, 2006, 4330 : 383 - +
  • [7] Communication-Efficient Adam-Type Algorithms for Distributed Data Mining
    Xian, Wenhan
    Huang, Feihu
    Huang, Heng
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 1245 - 1250
  • [8] An Efficient Framework for Mining Association Rules in the Distributed Databases
    Goyal, Lalit Mohan
    Beg, M. M. Sufyan
    Ahmad, Tanvir
    COMPUTER JOURNAL, 2018, 61 (05): : 645 - 657
  • [9] A new efficient distributed algorithm for mining association rules
    Zhao, Yan
    Zhou, Hong
    Liu, Zhijing
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 493 - 495
  • [10] Communication-efficient distributed oblivious transfer
    Beimel, Amos
    Chee, Yeow Meng
    Wang, Huaxiong
    Zhang, Liang Feng
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2012, 78 (04) : 1142 - 1157