High-Dimensional Bayesian Network Inference From Systems Genetics Data Using Genetic Node Ordering

被引:12
作者
Wang, Lingfei [1 ,2 ,3 ]
Audenaert, Pieter [4 ,5 ]
Michoel, Tom [1 ,6 ]
机构
[1] Univ Edinburgh, Roslin Inst, Div Genet & Genom, Easter Bush Campus, Edinburgh, Midlothian, Scotland
[2] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[3] Massachusetts Gen Hosp, Dept Mol Biol, Boston, MA 02114 USA
[4] Univ Ghent, IDLab, IMEC, Ghent, Belgium
[5] Univ Ghent, Bioinformat Inst Ghent, Ghent, Belgium
[6] Univ Bergen, Dept Informat, Computat Biol Unit, Bergen, Norway
基金
英国生物技术与生命科学研究理事会;
关键词
systems genetics; network inference; Bayesian network; expression quantitative trait loci analysis; gene expression; INTEGRATIVE GENOMICS APPROACH; COMPLEX TRAITS; EXPRESSION; RECONSTRUCTION; ARCHITECTURE; SELECTION; MAP;
D O I
10.3389/fgene.2019.01196
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Studying the impact of genetic variation on gene regulatory networks is essential to understand the biological mechanisms by which genetic variation causes variation in phenotypes. Bayesian networks provide an elegant statistical approach for multi-trait genetic mapping and modelling causal trait relationships. However, inferring Bayesian gene networks from high-dimensional genetics and genomics data is challenging, because the number of possible networks scales super-exponentially with the number of nodes, and the computational cost of conventional Bayesian network inference methods quickly becomes prohibitive. We propose an alternative method to infer high-quality Bayesian gene networks that easily scales to thousands of genes. Our method first reconstructs a node ordering by conducting pairwise causal inference tests between genes, which then allows to infer a Bayesian network via a series of independent variable selection problems, one for each gene. We demonstrate using simulated and real systems genetics data that this results in a Bayesian network with equal, and sometimes better, likelihood than the conventional methods, while having a significantly higher overlap with groundtruth networks and being orders of magnitude faster. Moreover our method allows for a unified false discovery rate control across genes and individual edges, and thus a rigorous and easily interpretable way for tuning the sparsity level of the inferred network. Bayesian network inference using pairwise node ordering is a highly efficient approach for reconstructing gene regulatory networks when prior information for the inclusion of edges exists or can be inferred from the available data.
引用
收藏
页数:13
相关论文
共 64 条
  • [1] Biophysically Motivated Regulatory Network Inference: Progress and Prospects
    Aeijoe, Tarmo
    Bonneau, Richard
    [J]. HUMAN HEREDITY, 2016, 81 (02) : 62 - 77
  • [2] Genetic effects on gene expression across human tissues
    Aguet, Francois
    Brown, Andrew A.
    Castel, Stephane E.
    Davis, Joe R.
    He, Yuan
    Jo, Brian
    Mohammadi, Pejman
    Park, Yoson
    Parsana, Princy
    Segre, Ayellet V.
    Strober, Benjamin J.
    Zappala, Zachary
    Cummings, Beryl B.
    Gelfand, Ellen T.
    Hadley, Kane
    Huang, Katherine H.
    Lek, Monkol
    Li, Xiao
    Nedzel, Jared L.
    Nguyen, Duyen Y.
    Noble, Michael S.
    Sullivan, Timothy J.
    Tukiainen, Taru
    MacArthur, Daniel G.
    Getz, Gad
    Management, Nih Program
    Addington, Anjene
    Guan, Ping
    Koester, Susan
    Little, A. Roger
    Lockhart, Nicole C.
    Moore, Helen M.
    Rao, Abhi
    Struewing, Jeffery P.
    Volpi, Simona
    Collection, Biospecimen
    Brigham, Lori E.
    Hasz, Richard
    Hunter, Marcus
    Johns, Christopher
    Johnson, Mark
    Kopen, Gene
    Leinweber, William F.
    Lonsdale, John T.
    McDonald, Alisa
    Mestichelli, Bernadette
    Myer, Kevin
    Roe, Bryan
    Salvatore, Michael
    Shad, Saboor
    [J]. NATURE, 2017, 550 (7675) : 204 - +
  • [3] The role of regulatory variation in complex traits and disease
    Albert, Frank W.
    Kruglyak, Leonid
    [J]. NATURE REVIEWS GENETICS, 2015, 16 (04) : 197 - 212
  • [4] Network inference, analysis, and modeling in systems biology
    Albert, Reke
    [J]. PLANT CELL, 2007, 19 (11) : 3327 - 3338
  • [5] How to infer gene networks from expression profiles
    Bansal, Mukesh
    Belcastro, Vincenzo
    Ambesi-Impiombato, Alberto
    di Bernardo, Diego
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
  • [6] Beckmann N. D., 2018, bioRxiv, DOI 10.1101/458430
  • [7] An Expanded View of Complex Traits: From Polygenic to Omnigenic
    Boyle, Evan A.
    Li, Yang I.
    Pritchard, Jonathan K.
    [J]. CELL, 2017, 169 (07) : 1177 - 1186
  • [8] Predictive modeling of genome-wide mRNA expression: From modules to molecules
    Bussemaker, Harmen J.
    Foat, Barrett C.
    Ward, Lucas D.
    [J]. ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2007, 36 : 329 - 347
  • [9] Harnessing naturally randomized transcription to infer regulatory relationships among genes
    Chen, Lin S.
    Emmert-Streib, Frank
    Storey, John D.
    [J]. GENOME BIOLOGY, 2007, 8 (10)
  • [10] Systems genetics approaches to understand complex traits
    Civelek, Mete
    Lusis, Aldons J.
    [J]. NATURE REVIEWS GENETICS, 2014, 15 (01) : 34 - 48