Network inference using informative priors

被引:124
作者
Mukherjee, Sach [1 ,2 ]
Speed, Terence P. [3 ,4 ]
机构
[1] Univ Warwick, Dept Stat, Coventry CV4 7AL, W Midlands, England
[2] Univ Warwick, Ctr Complex Sci, Coventry CV4 7AL, W Midlands, England
[3] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[4] Walter & Eliza Hall Inst Med Res, Parkville, Vic 3050, Australia
关键词
Bayesian networks; biological networks; graphical models; protein signaling;
D O I
10.1073/pnas.0802272105
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent years have seen much interest in the study of systems characterized by multiple interacting components. A class of statistical models called graphical models, in which graphs are used to represent probabilistic relationships between variables, provides a framework for formal inference regarding such systems. In many settings, the object of inference is the network structure itself. This problem of "network inference" is well known to be a challenging one. However, in scientific settings there is very often existing information regarding network connectivity. A natural idea then is to take account of such information during inference. This article addresses the question of incorporating prior information into network inference. We focus on directed models called Bayesian networks, and use Markov chain Monte Carlo to draw samples from posterior distributions over network structures. We introduce prior distributions on graphs capable of capturing information regarding network features including edges, classes of edges, degree distributions, and sparsity. We illustrate our approach in the context of systems biology, applying our methods to network inference in cancer signaling.
引用
收藏
页码:14313 / 14318
页数:6
相关论文
共 21 条
[1]  
[Anonymous], 1988, PROBABILISTIC REASON, DOI DOI 10.1016/C2009-0-27609-4
[2]  
Bernard A, 2005, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, P459
[3]   Being Bayesian about network structure. A Bayesian approach to structure discovery in Bayesian networks [J].
Friedman, N ;
Koller, D .
MACHINE LEARNING, 2003, 50 (1-2) :95-125
[4]   Improving Markov Chain Monte Carlo model search for data mining [J].
Giudici, P ;
Castelo, R .
MACHINE LEARNING, 2003, 50 (1-2) :127-158
[5]   LEARNING BAYESIAN NETWORKS - THE COMBINATION OF KNOWLEDGE AND STATISTICAL-DATA [J].
HECKERMAN, D ;
GEIGER, D ;
CHICKERING, DM .
MACHINE LEARNING, 1995, 20 (03) :197-243
[6]   Signal transduction via platelet-derived growth factor receptors [J].
Heldin, CH ;
Östman, A ;
Rönnstrand, L .
BIOCHIMICA ET BIOPHYSICA ACTA-REVIEWS ON CANCER, 1998, 1378 (01) :F79-F113
[7]  
Jeffreys H., 1998, The Theory of Probability
[8]   BAYESIAN VARIABLE SELECTION AND DATA INTEGRATION FOR BIOLOGICAL REGULATORY NETWORKS [J].
Jensen, Shane T. ;
Chen, Guang ;
Stoeckert, Christian J., Jr. .
ANNALS OF APPLIED STATISTICS, 2007, 1 (02) :612-633
[9]   Experiments in stochastic computation for high-dimensional graphical models [J].
Jones, B ;
Carvalho, C ;
Dobra, A ;
Hans, C ;
Carter, C ;
West, M .
STATISTICAL SCIENCE, 2005, 20 (04) :388-400
[10]   Graphical models [J].
Jordan, MI .
STATISTICAL SCIENCE, 2004, 19 (01) :140-155