Scoring Bayesian networks of mixed variables

被引:41
作者
Andrews, Bryan [1 ]
Ramsey, Joseph [2 ]
Cooper, Gregory F. [1 ]
机构
[1] Univ Pittsburgh, Pittsburgh, PA 15260 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Bayesian network structure learning; Mixed variables; Continuous and discrete variables;
D O I
10.1007/s41060-017-0085-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we outline two novel scoring methods for learning Bayesian networks in the presence of both continuous and discrete variables, that is, mixed variables. While much work has been done in the domain of automated Bayesian network learning, few studies have investigated this task in the presence of both continuous and discrete variables while focusing on scalability. Our goal is to provide two novel and scalable scoring functions capable of handling mixed variables. The first method, the Conditional Gaussian (CG) score, provides a highly efficient option. The second method, the Mixed Variable Polynomial (MVP) score, allows for a wider range of modeled relationships, including nonlinearity, but it is slower than CG. Both methods calculate log likelihood and degrees of freedom terms, which are incorporated into a Bayesian Information Criterion (BIC) score. Additionally, we introduce a structure prior for efficient learning of large networks and a simplification in scoring the discrete case which performs well empirically. While the core of this work focuses on applications in the search and score paradigm, we also show how the introduced scoring functions may be readily adapted as conditional independence tests for constraint-based Bayesian network learning algorithms. Lastly, we describe ways to simulate networks of mixed variable types and evaluate our proposed methods on such simulations.
引用
收藏
页码:3 / 18
页数:16
相关论文
共 27 条
[1]   STRONG CONSISTENCY OF LEAST-SQUARES ESTIMATES IN NORMAL LINEAR-REGRESSION [J].
ANDERSON, TW ;
TAYLOR, JB .
ANNALS OF STATISTICS, 1976, 4 (04) :788-790
[2]  
Bishop C. M., 2006, PATTERN RECOGN
[3]  
Bottcher S. G., 2004, THESIS
[4]   EXTENDED BIC FOR SMALL-n-LARGE-P SPARSE GLM [J].
Chen, Jiahua ;
Chen, Zehua .
STATISTICA SINICA, 2012, 22 (02) :555-574
[5]  
Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717
[6]   Learning Bayesian networks: approaches and issues [J].
Daly, Ronan ;
Shen, Qiang ;
Aitken, Stuart .
KNOWLEDGE ENGINEERING REVIEW, 2011, 26 (02) :99-157
[7]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[8]  
Heckerman D., 1995, Uncertainty in Artificial Intelligence. Proceedings of the Eleventh Conference (1995), P274
[9]  
Hsia C.-Y., 2017, ASIAN C MACHINE LEAR, V77, P33
[10]   MODEL SELECTION FOR GAUSSIAN MIXTURE MODELS [J].
Huang, Tao ;
Peng, Heng ;
Zhang, Kun .
STATISTICA SINICA, 2017, 27 (01) :147-169