Learning the Structure of Bayesian Networks from Incomplete Data Using a Mixture Model

被引:0
作者
Salman I. [1 ]
Vomlel J. [2 ]
机构
[1] Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Trojanova 13, Prague
[2] Institute of Information Theory and Automation of the CA, Pod Vodárenskou věží 4, Prague
来源
Informatica (Slovenia) | 2023年 / 47卷 / 01期
关键词
Bayesian network; belief-noisy-OR; EM-mixture; incomplete data; structure learning;
D O I
10.31449/inf.v47i1.4497
中图分类号
学科分类号
摘要
In this paper, we provide an approach to learning optimal Bayesian network (BN) structures from incomplete data based on the BIC score function using a mixture model to handle missing values. We have compared the proposed approach with other methods. Our experiments have been conducted on different models, some of them Belief Noisy-Or (BNO) ones. We have performed experiments using datasets with values missing completely at random having different missingness rates and data sizes. We have analyzed the significance of differences between the algorithm performance levels using the Wilcoxon test. The new approach typically learns additional edges in the case of Belief Noisy-or models. We have analyzed this issue using the Chi-square test of independence between the variables in the true models; this approach reveals that additional edges can be explained by strong dependence in generated data. An important property of our new method for learning BNs from incomplete data is that it can learn not only optimal general BNs but also specific Belief Noisy-Or models which is using in many applications such as medical application. © 2023 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:81 / 94
页数:13
相关论文
共 19 条
  • [1] Friedman Nir, Geiger Dan, Goldszmidt Moises, Bayesian network classifiers, Machine Learning, 20, 2-3, pp. 131-163, (1997)
  • [2] de Campos Cassio P, Scanagatta Mauro, Corani Giorgio, Zaffalon Marco, Entropy-based pruning for learning Bayesian networks using BIC, Artificial Intelligence, 260, pp. 42-50, (2018)
  • [3] Ruggieri Andrea, Stranieri Francesco, Stella Fabio, Scutari Marco, Hard and soft EM in Bayesian network learning from incomplete data, Algorithms, 13, 12, (2020)
  • [4] Pearl Judea, Probabilistic reasoning in intelligent systems: networks of plausible inference, (1988)
  • [5] Friedman Nir, Goldszmidt Moises, Learning Bayesian networks with local structure, Learning in graphical models, pp. 421-459, (1998)
  • [6] Liu Zhifa, Malone Brandon, Yuan Changhe, Empirical evaluation of scoring functions for Bayesian network model selection, Proceedings of the Ninth Annual MCBIOS Conference. Dealing with the Omics Data Deluge, (2012)
  • [7] Song Poh Choo, Chong Hui Yee, Ong Hong Choon, Looi Sing Yan, A model of Bayesian network analysis of the factors affecting student's higher level study decision: The private institution case, journal of Telecommunication, Electronic and Computer Engineering (JTEC), 8, 2, pp. 105-109, (2016)
  • [8] de Campos Cassio P, Zeng Zhi, Ji Qiang, Structure learning of Bayesian networks using constraints, Proceedings of the 26th Annual International Conference on Machine Learning, ICML'09, pp. 113-120, (2009)
  • [9] Cussens James, Bayesian network learning with cutting planes, Proceedings of the Twenty-Seventh Conference on Uncertainty in Artificial Intelligence, pp. 153-160, (2011)
  • [10] Dempster Arthur P, Laird Nan M, Rubin Donald B, Maximum likelihood from incomplete data via the EM algorithm, J. Roy. Statist. Soc. B, 39, pp. 1-38, (1977)