Fast Bayesian inference for gene regulatory networks using ScanBMA

被引:61
作者
Young, William Chad [1 ]
Raftery, Adrian E. [1 ]
Yeung, Ka Yee [2 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Univ Washington, Dept Microbiol, Seattle, WA 98195 USA
基金
爱尔兰科学基金会;
关键词
Bayesian inference; Bayesian model averaging; Gene regulatory networks; TIME-COURSE DATA; SELECTION; REGULARIZATION; REGRESSION; CONSTRUCTION; MODELS;
D O I
10.1186/1752-0509-8-47
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Genome-wide time-series data provide a rich set of information for discovering gene regulatory relationships. As genome-wide data for mammalian systems are being generated, it is critical to develop network inference methods that can handle tens of thousands of genes efficiently, provide a systematic framework for the integration of multiple data sources, and yield robust, accurate and compact gene-to-gene relationships. Results: We developed and applied ScanBMA, a Bayesian inference method that incorporates external information to improve the accuracy of the inferred network. In particular, we developed a new strategy to efficiently search the model space, applied data transformations to reduce the effect of spurious relationships, and adopted the g-prior to guide the search for candidate regulators. Our method is highly computationally efficient, thus addressing the scalability issue with network inference. The method is implemented as the ScanBMA function in the networkBMA Bioconductor software package. Conclusions: We compared ScanBMA to other popular methods using time series yeast data as well as time-series simulated data from the DREAM competition. We found that ScanBMA produced more compact networks with a greater proportion of true positives than the competing methods. Specifically, ScanBMA generally produced more favorable areas under the Receiver-Operating Characteristic and Precision-Recall curves than other regression-based methods and mutual-information based methods. In addition, ScanBMA is competitive with other network inference methods in terms of running time.
引用
收藏
页数:11
相关论文
共 45 条
  • [1] [Anonymous], 2007, EM ALGORITHM EXTENSI
  • [2] Inference of gene regulatory networks and compound mode of action from time course gene expression profiles
    Bansal, M
    Della Gatta, G
    di Bernardo, D
    [J]. BIOINFORMATICS, 2006, 22 (07) : 815 - 822
  • [3] Reverse engineering of regulatory networks in human B cells
    Basso, K
    Margolin, AA
    Stolovitzky, G
    Klein, U
    Dalla-Favera, R
    Califano, A
    [J]. NATURE GENETICS, 2005, 37 (04) : 382 - 390
  • [4] Evolutionary Stochastic Search for Bayesian Model Exploration
    Bottolo, Leonard
    Richardson, Sylvia
    [J]. BAYESIAN ANALYSIS, 2010, 5 (03): : 583 - 618
  • [5] Model uncertainty
    Clyde, M
    George, EI
    [J]. STATISTICAL SCIENCE, 2004, 19 (01) : 81 - 94
  • [6] D'haeseleer P, 1999, Pac Symp Biocomput, P41
  • [7] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [8] Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles
    Faith, Jeremiah J.
    Hayete, Boris
    Thaden, Joshua T.
    Mogno, Ilaria
    Wierzbowski, Jamey
    Cottarel, Guillaume
    Kasif, Simon
    Collins, James J.
    Gardner, Timothy S.
    [J]. PLOS BIOLOGY, 2007, 5 (01): : 54 - 66
  • [9] Regularization Paths for Generalized Linear Models via Coordinate Descent
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Rob
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01): : 1 - 22
  • [10] Topological and causal structure of the yeast transcriptional regulatory network
    Guelzim, N
    Bottani, S
    Bourgine, P
    Képès, F
    [J]. NATURE GENETICS, 2002, 31 (01) : 60 - 63