Fast Bayesian inference for gene regulatory networks using ScanBMA

被引：61

作者：

Young, William Chad ^{[1
]}

Raftery, Adrian E. ^{[1
]}

Yeung, Ka Yee ^{[2
]}

机构：

[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA

[2] Univ Washington, Dept Microbiol, Seattle, WA 98195 USA

来源：

BMC SYSTEMS BIOLOGY | 2014年 / 8卷

基金：

爱尔兰科学基金会;

关键词：

Bayesian inference; Bayesian model averaging; Gene regulatory networks; TIME-COURSE DATA; SELECTION; REGULARIZATION; REGRESSION; CONSTRUCTION; MODELS;

D O I：

10.1186/1752-0509-8-47

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Background: Genome-wide time-series data provide a rich set of information for discovering gene regulatory relationships. As genome-wide data for mammalian systems are being generated, it is critical to develop network inference methods that can handle tens of thousands of genes efficiently, provide a systematic framework for the integration of multiple data sources, and yield robust, accurate and compact gene-to-gene relationships. Results: We developed and applied ScanBMA, a Bayesian inference method that incorporates external information to improve the accuracy of the inferred network. In particular, we developed a new strategy to efficiently search the model space, applied data transformations to reduce the effect of spurious relationships, and adopted the g-prior to guide the search for candidate regulators. Our method is highly computationally efficient, thus addressing the scalability issue with network inference. The method is implemented as the ScanBMA function in the networkBMA Bioconductor software package. Conclusions: We compared ScanBMA to other popular methods using time series yeast data as well as time-series simulated data from the DREAM competition. We found that ScanBMA produced more compact networks with a greater proportion of true positives than the competing methods. Specifically, ScanBMA generally produced more favorable areas under the Receiver-Operating Characteristic and Precision-Recall curves than other regression-based methods and mutual-information based methods. In addition, ScanBMA is competitive with other network inference methods in terms of running time.

引用

页数：11

共 45 条

[1]

[Anonymous], 2007, EM ALGORITHM EXTENSI

[2] Inference of gene regulatory networks and compound mode of action from time course gene expression profiles [J].

Bansal, M ;

Della Gatta, G ;

di Bernardo, D .

BIOINFORMATICS, 2006, 22 (07) :815-822

[3] Reverse engineering of regulatory networks in human B cells [J].

Basso, K ;

Margolin, AA ;

Stolovitzky, G ;

Klein, U ;

Dalla-Favera, R ;

Califano, A .

NATURE GENETICS, 2005, 37 (04) :382-390

[4] Evolutionary Stochastic Search for Bayesian Model Exploration [J].

Bottolo, Leonard ;

Richardson, Sylvia .

BAYESIAN ANALYSIS, 2010, 5 (03) :583-618

[5] Model uncertainty [J].

Clyde, M ;

George, EI .

STATISTICAL SCIENCE, 2004, 19 (01) :81-94

[6]

D'haeseleer P, 1999, Pac Symp Biocomput, P41

[7] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[8] Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles [J].

Faith, Jeremiah J. ;

Hayete, Boris ;

Thaden, Joshua T. ;

Mogno, Ilaria ;

Wierzbowski, Jamey ;

Cottarel, Guillaume ;

Kasif, Simon ;

Collins, James J. ;

Gardner, Timothy S. .

PLOS BIOLOGY, 2007, 5 (01) :54-66

[9] Regularization Paths for Generalized Linear Models via Coordinate Descent [J].

Friedman, Jerome ;

Hastie, Trevor ;

Tibshirani, Rob .

JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22

[10] Topological and causal structure of the yeast transcriptional regulatory network [J].

Guelzim, N ;

Bottani, S ;

Bourgine, P ;

Képès, F .

NATURE GENETICS, 2002, 31 (01) :60-63

← 1 2 3 4 5 →