A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits

被引:20
作者
Gianola, Daniel [1 ,2 ,3 ,4 ]
Fernando, Rohan L. [3 ]
机构
[1] Univ Wisconsin, Dept Anim Sci, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Dairy Sci, Madison, WI 53706 USA
[3] Iowa State Univ, Dept Anim Sci, Ames, IA 50011 USA
[4] TUM, Sch Life Sci, Dept Plant Sci, D-85354 Freising Weihenstephan, Germany
关键词
Bayesian Lasso; complex traits; GWAS; multiple-traits; prediction; quantitative genetics; quantitative genomics; GENETIC VALUE; QUANTITATIVE TRAITS; MOLECULAR MARKERS; WIDE ASSOCIATION; SELECTION; REGRESSION; PLANT; HERITABILITY; MODELS; VALUES;
D O I
10.1534/genetics.119.302934
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
A multiple-trait Bayesian LASSO (MBL) for genome-based analysis and prediction of quantitative traits is presented and applied to two real data sets. The data-generating model is a multivariate linear Bayesian regression on possibly a huge number of molecular markers, and with a Gaussian residual distribution posed. Each (one per marker) of the Tx1 vectors of regression coefficients (T: number of traits) is assigned the same T-variate Laplace prior distribution, with a null mean vector and unknown scale matrix sigma. The multivariate prior reduces to that of the standard univariate Bayesian LASSO when T=1. The covariance matrix of the residual distribution is assigned a multivariate Jeffreys prior, and sigma is given an inverse-Wishart prior. The unknown quantities in the model are learned using a Markov chain Monte Carlo sampling scheme constructed using a scale-mixture of normal distributions representation. MBL is demonstrated in a bivariate context employing two publicly available data sets using a bivariate genomic best linear unbiased prediction model (GBLUP) for benchmarking results. The first data set is one where wheat grain yields in two different environments are treated as distinct traits. The second data set comes from genotyped Pinus trees, with each individual measured for two traits: rust bin and gall volume. In MBL, the bivariate marker effects are shrunk differentially, i.e., "short" vectors are more strongly shrunk toward the origin than in GBLUP; conversely, "long" vectors are shrunk less. A predictive comparison was carried out as well in wheat, where the comparators of MBL were bivariate GBLUP and bivariate Bayes C pi-a variable selection procedure. A training-testing layout was used, with 100 random reconstructions of training and testing sets. For the wheat data, all methods produced similar predictions. In Pinus, MBL gave better predictions that either a Bayesian bivariate GBLUP or the single trait Bayesian LASSO. MBL has been implemented in the Julia language package JWAS, and is now available for the scientific community to explore with different traits, species, and environments. It is well known that there is no universally best prediction machine, and MBL represents a new resource in the armamentarium for genome-enabled analysis and prediction of complex traits.
引用
收藏
页码:305 / 331
页数:27
相关论文
共 77 条
  • [51] Beyond Missing Heritability: Prediction of Complex Traits
    Makowsky, Robert
    Pajewski, Nicholas M.
    Klimentidis, Yann C.
    Vazquez, Ana I.
    Duarte, Christine W.
    Allison, David B.
    de los Campos, Gustavo
    [J]. PLOS GENETICS, 2011, 7 (04):
  • [52] Finding the missing heritability of complex diseases
    Manolio, Teri A.
    Collins, Francis S.
    Cox, Nancy J.
    Goldstein, David B.
    Hindorff, Lucia A.
    Hunter, David J.
    McCarthy, Mark I.
    Ramos, Erin M.
    Cardon, Lon R.
    Chakravarti, Aravinda
    Cho, Judy H.
    Guttmacher, Alan E.
    Kong, Augustine
    Kruglyak, Leonid
    Mardis, Elaine
    Rotimi, Charles N.
    Slatkin, Montgomery
    Valle, David
    Whittemore, Alice S.
    Boehnke, Michael
    Clark, Andrew G.
    Eichler, Evan E.
    Gibson, Greg
    Haines, Jonathan L.
    Mackay, Trudy F. C.
    McCarroll, Steven A.
    Visscher, Peter M.
    [J]. NATURE, 2009, 461 (7265) : 747 - 753
  • [53] Meuwissen THE, 2001, GENETICS, V157, P1819
  • [54] Predictive ability of genome-assisted statistical models under various forms of gene action
    Momen, Mehdi
    Mehrgardi, Ahmad Ayatollahi
    Sheikhi, Ayyub
    Kranis, Andreas
    Tusell, Llibert
    Morota, Gota
    Rosa, Guilherme J. M.
    Gianola, Daniel
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [55] Multi-environment Genomic Prediction of Plant Traits Using Deep Learners With Dense Architecture
    Montesinos-Lopez, Abelardo
    Montesinos-Lopez, Osval A.
    Gianola, Daniel
    Crossa, Jose
    Hernandez-Suarez, Carlos M.
    [J]. G3-GENES GENOMES GENETICS, 2018, 8 (12): : 3813 - 3828
  • [56] New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes
    Montesinos-Lopez, Osval A.
    Martin-Vallejo, Javier
    Crossa, Jose
    Gianola, Daniel
    Hernandez-Suarez, Carlos M.
    Montesinos-Lopez, Abelardo
    Juliana, Philomin
    Singh, Ravi
    [J]. G3-GENES GENOMES GENETICS, 2019, 9 (05): : 1545 - 1556
  • [57] A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding
    Montesinos-Lopez, Osval A.
    Martin-Vallejo, Javier
    Crossa, Jose
    Gianola, Daniel
    Hernandez-Suarez, Carlos M.
    Montesinos-Lopez, Abelardo
    Juliana, Philomin
    Singh, Ravi
    [J]. G3-GENES GENOMES GENETICS, 2019, 9 (02): : 601 - 618
  • [58] Multi-trait, Multi-environment Deep Learning Modeling for Genomic-Enabled Prediction of Plant Traits
    Montesinos-Lopez, Osval A.
    Montesinos-Lopez, Abelardo
    Crossa, Jose
    Gianola, Daniel
    Hernandez-Suarez, Carlos M.
    Martin-Vallejo, Javier
    [J]. G3-GENES GENOMES GENETICS, 2018, 8 (12): : 3829 - 3840
  • [59] Kernel-based whole-genome prediction of complex traits: a review
    Morota, Gota
    Gianola, Daniel
    [J]. FRONTIERS IN GENETICS, 2014, 5
  • [60] Simultaneous Discovery, Estimation and Prediction Analysis of Complex Traits Using a Bayesian Mixture Model
    Moser, Gerhard
    Lee, Sang Hong
    Hayes, Ben J.
    Goddard, Michael E.
    Wray, Naomi R.
    Visscher, Peter M.
    [J]. PLOS GENETICS, 2015, 11 (04):