Collective Mining of Bayesian Networks from Distributed Heterogeneous Data

被引:0
作者
R. Chen
K. Sivakumar
H. Kargupta
机构
[1] Washington State University,School of Electrical Engineering and Computer Science
[2] University of Maryland Baltimore County,Department of Computer Science and Electrical Engineering
来源
Knowledge and Information Systems | 2004年 / 6卷
关键词
Bayesian network; Collective data mining; Distributed data mining; Heterogeneous data; Web log mining;
D O I
暂无
中图分类号
学科分类号
摘要
We present a collective approach to learning a Bayesian network from distributed heterogeneous data. In this approach, we first learn a local Bayesian network at each site using the local data. Then each site identifies the observations that are most likely to be evidence of coupling between local and non-local variables and transmits a subset of these observations to a central site. Another Bayesian network is learnt at the central site using the data transmitted from the local site. The local and central Bayesian networks are combined to obtain a collective Bayesian network, which models the entire data. Experimental results and theoretical justification that demonstrate the feasibility of our approach are presented.
引用
收藏
页码:164 / 187
页数:23
相关论文
共 50 条
  • [31] Distributed Execution Environment for Data Mining as Service
    Kholod, Ivan
    Borisenko, Konstantin
    PROCEEDINGS OF THE 2016 IEEE NORTH WEST RUSSIA SECTION YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING CONFERENCE (ELCONRUSNW), 2016, : 236 - 241
  • [32] Distributed data mining for e-business
    Liu, Bin
    Cao, Shu Gui
    He, Wu
    INFORMATION TECHNOLOGY & MANAGEMENT, 2011, 12 (02) : 67 - 79
  • [33] DXCS: an XCS system for distributed data mining
    Dam, Hai H.
    Abbass, Hussein A.
    Lokan, Chris
    GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 1883 - 1890
  • [34] Distributed data mining services leveraging WSRF
    Congiusta, Antonio
    Talia, Domenico
    Trunfio, Paolo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING THEORY METHODS AND APPLICATIONS, 2007, 23 (01): : 34 - 41
  • [35] Deep Bayesian network architecture for Big Data mining
    Njah, Hasna
    Jamoussi, Salma
    Mahdi, Walid
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (02)
  • [36] From parallel data mining to grid-enabled distributed knowledge discovery
    Cesario, Eugenio
    Talia, Domenico
    ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2007, 4482 : 25 - +
  • [37] Deploying mobile agents in distributed data mining
    Li, Xining
    Ni, Jinglo
    EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 322 - +
  • [38] Distributed data mining for e-business
    Bin Liu
    Shu Gui Cao
    Wu He
    Information Technology and Management, 2011, 12 : 67 - 79
  • [39] A communication efficient and scalable distributed data mining for the astronomical data
    Govada, A.
    Sahay, S. K.
    ASTRONOMY AND COMPUTING, 2016, 16 : 166 - 173
  • [40] Distributed data mining in grid computing environment
    Xue, Huifang
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (01): : 2719 - 2723