Inferring Regulatory Networks by Combining Perturbation Screens and Steady State Gene Expression Profiles

被引:18
作者
Shojaie, Ali [1 ]
Jauhiainen, Alexandra [2 ]
Kallitsis, Michael [3 ]
Michailidis, George [3 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[3] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
MODELS; CAUSALITY; SYSTEMS; RECONSTRUCTION; DISCOVERY;
D O I
10.1371/journal.pone.0082393
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Reconstructing transcriptional regulatory networks is an important task in functional genomics. Data obtained from experiments that perturb genes by knockouts or RNA interference contain useful information for addressing this reconstruction problem. However, such data can be limited in size and/or are expensive to acquire. On the other hand, observational data of the organism in steady state (e.g., wild-type) are more readily available, but their informational content is inadequate for the task at hand. We develop a computational approach to appropriately utilize both data sources for estimating a regulatory network. The proposed approach is based on a three-step algorithm to estimate the underlying directed but cyclic network, that uses as input both perturbation screens and steady state gene expression data. In the first step, the algorithm determines causal orderings of the genes that are consistent with the perturbation data, by combining an exhaustive search method with a fast heuristic that in turn couples a Monte Carlo technique with a fast search algorithm. In the second step, for each obtained causal ordering, a regulatory network is estimated using a penalized likelihood based method, while in the third step a consensus network is constructed from the highest scored ones. Extensive computational experiments show that the algorithm performs well in reconstructing the underlying network and clearly outperforms competing approaches that rely only on a single data source. Further, it is established that the algorithm produces a consistent estimate of the regulatory network.
引用
收藏
页数:16
相关论文
共 43 条
[1]   Modeling the temporal interplay of molecular signaling and gene expression by using dynamic nested effects models [J].
Anchang, Benedict ;
Sadeh, Mohammad J. ;
Jacob, Juby ;
Tresch, Achim ;
Vlad, Marcel O. ;
Oefner, Peter J. ;
Spang, Rainer .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (16) :6447-6452
[2]  
[Anonymous], 2006, INTERJOURNALCOMPLEX
[3]  
[Anonymous], BIOINFORMATICS
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]  
Dasgupta Sanjoy, 2008, Algorithms
[6]   Being Bayesian about network structure. A Bayesian approach to structure discovery in Bayesian networks [J].
Friedman, N ;
Koller, D .
MACHINE LEARNING, 2003, 50 (1-2) :95-125
[7]   Modeling gene expression regulatory networks with the sparse vector autoregressive model [J].
Fujita, Andre ;
Sato, Joao R. ;
Garay-Malpartida, Humberto M. ;
Yamaguchi, Rui ;
Miyano, Satoru ;
Sogayar, Mari C. ;
Ferreira, Carlos E. .
BMC SYSTEMS BIOLOGY, 2007, 1
[8]   DREAM4: Combining Genetic and Dynamic Information to Identify Biological Networks and Dynamical Models [J].
Greenfield, Alex ;
Madar, Aviv ;
Ostrer, Harry ;
Bonneau, Richard .
PLOS ONE, 2010, 5 (10)
[9]   Joint estimation of multiple graphical models [J].
Guo, Jian ;
Levina, Elizaveta ;
Michailidis, George ;
Zhu, Ji .
BIOMETRIKA, 2011, 98 (01) :1-15
[10]   Genetic reconstruction of a functional transcriptional regulatory network [J].
Hu, Zhanzhi ;
Killion, Patrick J. ;
Iyer, Vishwanath R. .
NATURE GENETICS, 2007, 39 (05) :683-687