Collective Mining of Bayesian Networks from Distributed Heterogeneous Data

被引:0
|
作者
R. Chen
K. Sivakumar
H. Kargupta
机构
[1] Washington State University,School of Electrical Engineering and Computer Science
[2] University of Maryland Baltimore County,Department of Computer Science and Electrical Engineering
来源
Knowledge and Information Systems | 2004年 / 6卷
关键词
Bayesian network; Collective data mining; Distributed data mining; Heterogeneous data; Web log mining;
D O I
暂无
中图分类号
学科分类号
摘要
We present a collective approach to learning a Bayesian network from distributed heterogeneous data. In this approach, we first learn a local Bayesian network at each site using the local data. Then each site identifies the observations that are most likely to be evidence of coupling between local and non-local variables and transmits a subset of these observations to a central site. Another Bayesian network is learnt at the central site using the data transmitted from the local site. The local and central Bayesian networks are combined to obtain a collective Bayesian network, which models the entire data. Experimental results and theoretical justification that demonstrate the feasibility of our approach are presented.
引用
收藏
页码:164 / 187
页数:23
相关论文
共 50 条
  • [21] Trajectory Data Mining in Distributed Sensor Networks
    Qiao, Shaojie
    Jin, Huidong
    Gao, Yunjun
    Tang, Lu-An
    Xing, Huanlai
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [22] Distributed Bayesian Matrix Decomposition for Big Data Mining and Clustering
    Zhang, Chihao
    Yang, Yang
    Zhou, Wei
    Zhang, Shihua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3701 - 3713
  • [23] Data mining of Bayesian networks using cooperative coevolution
    Wong, ML
    Lee, SY
    Leung, KS
    DECISION SUPPORT SYSTEMS, 2004, 38 (03) : 451 - 472
  • [24] Data mining based Bayesian networks for best classification
    Ouali, Abdelaziz
    Cherif, Amar Ramdane
    Krebs, Marie-Odile
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) : 1278 - 1292
  • [25] Application of Bayesian networks and data mining to biomedical problems
    Kammerdiner, Alla R.
    Gupal, Anatoliy M.
    Pardalos, Panos M.
    DATA MINING, SYSTEMS ANALYSIS, AND OPTIMIZATION IN BIOMEDICINE, 2007, 953 : 132 - +
  • [26] Research and application of structure learning algorithm for Bayesian networks from distributed data
    Zhang, SZ
    Ding, H
    Wang, XK
    Liu, H
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1667 - 1671
  • [27] Intelligent Data Mining in Autonomous Heterogeneous Distributed Bio Databases
    Shamim, Azra
    Shaikh, Maqbool Uddin
    Malik, Saif Ur Rehman
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 1, 2010, : 6 - 10
  • [28] Distributed Data Mining for Multiple Sourced Heterogeneous Datasets: A Survey
    Li, Xing-ying
    Li, Shan-zi
    Wu, Yi-xuan
    He, Ai-jia
    Huang, Xiao-ya
    Zhao, Xin
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM 2018), 2018, 310 : 329 - 337
  • [29] Mining collective pair data from the web
    Fan, Cong
    Jiang, Long
    Zhou, Ming
    Wang, Shi-Long
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3997 - +
  • [30] Distributed data mining in peer-to-peer networks
    Datta, Souptik
    Bhaduri, Kanishka
    Giannella, Chris
    Kargupta, Hillol
    Wolff, Ran
    IEEE INTERNET COMPUTING, 2006, 10 (04) : 18 - 26