A novel algorithm for scalable and accurate Bayesian network learning

被引:0
作者
Brown, LE [1 ]
Tsamardinos, L [1 ]
Aliferis, CF [1 ]
机构
[1] Vanderbilt Univ, Dept Biomed Informat, Discovery Syst Lab, Nashville, TN USA
来源
MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2 | 2004年 / 107卷
关键词
expert systems; causal discovery; artificial intelligence; algorithms; Bayesian analysis; Bayesian Networks;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bayesian Networks (BN) is a knowledge representation formalism that has been proven to be valuable in biomedicine for constructing decision support systems and for generating causal hypotheses from data. Given the emergence of datasets in medicine with thousands of variables and that current algorithms do not scale more than a few hundred variables in practical domains, new efficient and accurate algorithms are needed to learn high quality BNs from data. We present a new algorithm called Max-Min Hill-Climbing (MMHC) that builds upon and improves the Sparse Candidate (SC) algorithm; a state-of-the-art algorithm that scales up to datasets involving hundreds of variables provided the generating networks are sparse. Compared to the SC, on a number of datasets from medicine and biology, (a) MMHC discovers BNs that are structurally closer to the data-generating BN, (b) the discovered networks are more probable given the data, (c) MMHC is computationally more efficient and scalable than SC, and (d) the generating networks are not required to be uniformly sparse nor is the user of MMHC required to guess correctly the network connectivity.
引用
收藏
页码:711 / 715
页数:5
相关论文
共 14 条
[1]  
Andreassen S., 1989, COMPUTER AIDED ELECT
[2]  
[Anonymous], 1999, Probabilistic Networks and Expert Systems
[3]  
BAEZEYATES R, 1999, MODERN INFORMATION
[4]  
BEINLICH IA, 1989, 2 EUR C ART INT MED
[5]   A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia [J].
Chapman, WW ;
Fizman, M ;
Chapman, BE ;
Huag, PJ .
JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (01) :4-14
[6]   Using Bayesian networks to analyze expression data [J].
Friedman, N ;
Linial, M ;
Nachman, I ;
Pe'er, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :601-620
[7]  
Friedman N., 1999, 15 C UNC ART INT UAI
[8]   LEARNING BAYESIAN NETWORKS - THE COMBINATION OF KNOWLEDGE AND STATISTICAL-DATA [J].
HECKERMAN, D ;
GEIGER, D ;
CHICKERING, DM .
MACHINE LEARNING, 1995, 20 (03) :197-243
[9]  
Singh M., 1993, Uncertainty in Artificial Intelligence. Proceedings of the Ninth Conference (1993), P259
[10]   Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization [J].
Spellman, PT ;
Sherlock, G ;
Zhang, MQ ;
Iyer, VR ;
Anders, K ;
Eisen, MB ;
Brown, PO ;
Botstein, D ;
Futcher, B .
MOLECULAR BIOLOGY OF THE CELL, 1998, 9 (12) :3273-3297