Hierarchical estimation of parameters in Bayesian networks

被引:17
作者
Azzimonti, Laura [1 ]
Corani, Giorgio [1 ]
Zaffalon, Marco [1 ]
机构
[1] IDSIA SUPSI NSI, Manno, Switzerland
基金
瑞士国家科学基金会;
关键词
Hierarchical Bayesian modelling; Bayesian networks; Variational inference; Multi-domain classification; CLASSIFIERS;
D O I
10.1016/j.csda.2019.02.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A novel approach for parameter estimation in Bayesian networks is presented. The main idea is to introduce a hyper-prior in the Multinomial-Dirichlet model, traditionally used for conditional distribution estimation in Bayesian networks. The resulting hierarchical model jointly estimates different conditional distributions belonging to the same conditional probability table, thus borrowing statistical strength from each other. An analytical study of the dependence structure a priori induced by the hierarchical model is performed and an ad hoc variational algorithm for fast and accurate inference is derived. The proposed hierarchical model yields a major performance improvement in classification with Bayesian networks compared to traditional models. The proposed variational algorithm reduces by two orders of magnitude the computational time, with the same accuracy in parameter estimation, compared to traditional MCMC methods. Moreover, motivated by a real case study, the hierarchical model is applied to the estimation of Bayesian networks parameters by borrowing strength from related domains. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:67 / 91
页数:25
相关论文
共 38 条
[1]  
[Anonymous], 2017, ARXIV170306777
[2]   Integer Linear Programming for the Bayesian network structure learning problem [J].
Bartlett, Mark ;
Cussens, James .
ARTIFICIAL INTELLIGENCE, 2017, 244 :258-271
[3]   Variational Inference: A Review for Statisticians [J].
Blei, David M. ;
Kucukelbir, Alp ;
McAuliffe, Jon D. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :859-877
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]  
Bucher D, 2016, ACSR ADV COMPUT, V46, P89
[7]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29
[8]   Assessing Robustness of Intrinsic Tests of Independence in Two-Way Contingency Tables [J].
Casella, George ;
Moreno, Elias .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (487) :1261-1271
[9]  
Cellina F., 2016, 4 EUR C BEH EN EFF B, P8
[10]  
Darwiche A, 2009, MODELING AND REASONING WITH BAYESIAN NETWORKS, P1, DOI 10.1017/CBO9780511811357