Objective Bayesian Search of Gaussian Directed Acyclic Graphical Models for Ordered Variables with Non-Local Priors

被引:25
作者
Altomare, Davide [1 ]
Consonni, Guido [2 ]
La Rocca, Luca [3 ]
机构
[1] Univ Pavia, Dipartimento Matemat, I-27100 Pavia, Italy
[2] Univ Cattolica Sacro Cuore, Dipartimento Sci Stat, I-20123 Milan, Italy
[3] Univ Modena & Reggio Emilia, Dipartimento Sci Fis Informat & Matemat, I-41125 Modena, Italy
关键词
Directed acyclic graph; Fractional Bayes factor; Gaussian graphical model; High-dimensional sparse graph; Moment prior; Non-local prior; Objective Bayes; Regulatory network; Stochastic search; Structural learning; SELECTION; PROBABILITIES; NETWORKS;
D O I
10.1111/biom.12018
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Directed acyclic graphical (DAG) models are increasingly employed in the study of physical and biological systems to model direct influences between variables. Identifying the graph from data is a challenging endeavor, which can be more reasonably tackled if the variables are assumed to satisfy a given ordering; in this case we simply have to estimate the presence or absence of each potential edge. Working under this assumption, we propose an objective Bayesian method for searching the space of Gaussian DAG models, which provides a rich output from minimal input. We base our analysis on non-local parameter priors, which are especially suited for learning sparse graphs, because they allow a faster learning rate, relative to ordinary local parameter priors, when the true unknown sampling distribution belongs to a simple model. We implement an efficient stochastic search algorithm, which deals effectively with data sets having sample size smaller than the number of variables, and apply our method to a variety of simulated and real data sets. Our approach compares favorably, in terms of the ROC curve for edge hit rate versus false alarm rate, to current state-of-the-art frequentist methods relying on the assumption of ordered variables; under this assumption it exhibits a competitive advantage over the PC-algorithm, which can be considered as a frequentist benchmark for unordered variables. Importantly, we find that our method is still at an advantage for learning the skeleton of the DAG, when the ordering of the variables is only moderately mis-specified. Prospectively, our method could be coupled with a strategy to learn the order of the variables, thus dropping the known ordering assumption.
引用
收藏
页码:478 / 487
页数:10
相关论文
共 33 条
[1]  
[Anonymous], BAYESIAN STAT, DOI DOI 10.1093/ACPROF:OSO/9780199694587.003.0004
[2]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[3]  
Banerjee O, 2008, J MACH LEARN RES, V9, P485
[4]   Optimal predictive model selection [J].
Barbieri, MM ;
Berger, JO .
ANNALS OF STATISTICS, 2004, 32 (03) :870-897
[5]   Posterior model probabilities via path-based pairwise priors [J].
Berger, JO ;
Molina, G .
STATISTICA NEERLANDICA, 2005, 59 (01) :3-15
[6]   Objective Bayesian model selection in Gaussian graphical models [J].
Carvalho, C. M. ;
Scott, J. G. .
BIOMETRIKA, 2009, 96 (03) :497-512
[7]  
Consonni G., 2010, M1013 U SOUTH SOUTH
[8]   Objective Bayes Factors for Gaussian Directed Acyclic Graphical Models [J].
Consonni, Guido ;
La Rocca, Luca .
SCANDINAVIAN JOURNAL OF STATISTICS, 2012, 39 (04) :743-756
[9]  
Cowell Robert G., 1999, Probabilistic networks and expert systems
[10]  
Dawid AP, 2011, HBK PHILOS SCI, V7, P607