Learning Bayesian networks from incomplete data based on EMI method

被引:0
|
作者
Tian, FZ [1 ]
Zhang, HW [1 ]
Lu, YC [1 ]
机构
[1] Tsing Hua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, there are few efficient methods in practice for learning Bayesian networks from incomplete data, which affects their use in real world data mining applications. This paper presents a general-duty method that estimates the (Conditional) Mutual Information directly from incomplete datasets, EMI. EMI starts by computing the interval estimates of a joint probability of a variable set, which are obtained from the possible completions of the incomplete dataset. And then computes a point estimate via a convex combination of the extreme points, with weights depending on the assumed pattern of missing data. Finally, based on these point estimates, EMI gets the estimated (conditional) Mutual Information. This paper also applies EMI to the dependency analysis based learning algorithm by J. Cheng so as to efficiently learn BNs with incomplete data. The experimental results on Asia and Alarm networks show that EMI based algorithm is much more efficient than two search & scoring based algorithms, SEM and EM-EA algorithms. In terms of accuracy, EMI based algorithm is more accurate than SEM algorithm, and comparable with EM-EA algorithm.
引用
收藏
页码:323 / 330
页数:8
相关论文
共 50 条
  • [21] Rehabilitating of Incomplete Data Sets Based on Bayesian networks
    Li, Xiaoyi
    Xu, Zhaodi
    Li, Zhenpeng
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2595 - 2598
  • [22] LEARNING BAYESIAN NETWORKS FOR REGRESSION FROM INCOMPLETE DATABASES
    Fernandez, Antonio
    Nielsen, Jens D.
    Salmeron, Antonio
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2010, 18 (01) : 69 - 86
  • [23] METHOD OF PROBABILISTIC INFERENCE FROM LEARNING DATA IN BAYESIAN NETWORKS
    Terent'yev, A. N.
    Biduk, P. I.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2007, 43 (03) : 391 - 396
  • [24] Scalable Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data
    Linzner, Dominik
    Schmidt, Michael
    Koeppl, Heinz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] A simulated annealing-based method for learning Bayesian networks from statistical data
    Janzura, M
    Nielsen, J
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (03) : 335 - 348
  • [26] Efficient learning of bounded-treewidth Bayesian networks from complete and incomplete data sets
    Scanagatta, Mauro
    Corani, Giorgio
    Zaffalon, Marco
    Yoo, Jaemin
    Kang, U.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 95 : 152 - 166
  • [27] Efficient learning of bounded-treewidth Bayesian networks from complete and incomplete data sets
    Scanagatta, Mauro (mauro@idsia.ch), 1600, Elsevier Inc. (95):
  • [28] Learning Bayesian network equivalence classes from incomplete data
    Borchani, Hanen
    Ben Amor, Nahla
    Mellouli, Khaled
    DISCOVERY SCIENCE, PROCEEDINGS, 2006, 4265 : 291 - 295
  • [29] Study of the Case of Learning Bayesian Network from Incomplete Data
    Cao Yonghui
    2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 4, PROCEEDINGS, 2009, : 66 - 69
  • [30] An experimental comparison of methods for handling incomplete data in learning parameters of Bayesian networks
    Onisko, A
    Druzdzel, MJ
    Wasyluk, H
    INTELLIGENT INFORMATION SYSTEMS 2002, PROCEEDINGS, 2002, 17 : 351 - 360