Learning Bayesian networks from incomplete databases using a novel evolutionary algorithm

被引:34
作者
Wong, Man Leung [1 ]
Guo, Yuan Yuan [1 ]
机构
[1] Lingnan Univ, Dept Computing & Decis Sci, Tuen Mun, Hong Kong, Peoples R China
关键词
data mining; machine learning; Bayesian networks; evolutionary algorithms;
D O I
10.1016/j.dss.2008.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel method for learning Bayesian networks from incomplete databases in the presence of missing values, which combines an evolutionary algorithm with the traditional Expectation Maximization (EM) algorithm. A data completing procedure is presented for learning and evaluating the candidate networks. Moreover, a strategy is introduced to obtain better initial networks to facilitate the method. The new method can also overcome the problem of getting stuck in sub-optimal solutions which occurs in most existing learning algorithms. The experimental results on the databases generated from several benchmark networks illustrate that the new method has better performance than some state-of-the-art algorithms. We also apply the method to a data mining problem and compare the performance of the discovered Bayesian networks with the models generated by other learning algorithms. The results demonstrate that our method outperforms other algorithms. (c) 2008 Elsevier B.V All rights reserved.
引用
收藏
页码:368 / 383
页数:16
相关论文
共 54 条
[1]   Decision support for real-time telemarketing operations through Bayesian network learning [J].
Ahn, JH ;
Ezawa, KJ .
DECISION SUPPORT SYSTEMS, 1997, 21 (01) :17-27
[2]  
ANDREASSEN S, 1987, P 10 INT JOINT C ART, P366
[3]  
[Anonymous], 1988, PROBABILISTIC REASON, DOI DOI 10.1016/C2009-0-27609-4
[4]  
Beaumont GP, 1996, STAT TESTS INTRO MIN
[5]  
BEINLICH IA, 1989, P 2 EUR C ART INT ME, P247
[6]  
Bhattacharyya S., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P144
[7]   Learning Bayesian networks in the space of structures by estimation of distribution algorithms [J].
Blanco, R ;
Inza, I ;
Larrañga, P .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2003, 18 (02) :205-220
[8]   Microarray missing value imputation by iterated local least squares [J].
Cai, ZP ;
Heydari, M ;
Lin, GH .
PROCEEDINGS OF THE 4TH ASIA-PACIFIC BIOINFORMATICS CONFERENCE, 2006, 3 :159-168
[9]   Learning Bayesian networks from data: An information-theory based approach [J].
Cheng, J ;
Greiner, R ;
Kelly, J ;
Bell, D ;
Liu, WR .
ARTIFICIAL INTELLIGENCE, 2002, 137 (1-2) :43-90
[10]   Learning equivalence classes of Bayesian-network structures [J].
Chickering, DM .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :445-498