Genomic Prediction for Quantitative Traits Is Improved by Mapping Variants to Gene Ontology Categories in Drosophila melanogaster

被引:93
作者
Edwards, Stefan M. [1 ,2 ,3 ]
Sorensen, Izel F. [1 ]
Sarup, Pernille [1 ]
Mackay, Trudy F. C. [4 ,5 ]
Sorensen, Peter [1 ]
机构
[1] Aarhus Univ, Dept Mol Biol & Genet, Ctr Quantitat Genet & Genom, DK-8830 Tjele, Denmark
[2] Univ Edinburgh, Roslin Inst, Easter Bush EH25 9RG, Midlothian, Scotland
[3] Univ Edinburgh, Royal Dick Sch Vet Studies, Easter Bush EH25 9RG, Midlothian, Scotland
[4] North Carolina State Univ, Dept Biol Sci, Raleigh, NC 27695 USA
[5] North Carolina State Univ, Genet Program, Raleigh, NC 27695 USA
基金
美国国家卫生研究院;
关键词
genomic feature models; best linear unbiased prediction; Drosophila Genetic Reference Population; startle response; starvation resistance; chill coma recovery time; genomic selection; GenPred; shared data resource; PARTITIONING HERITABILITY; COMPLEX TRAITS; ARCHITECTURE; POPULATIONS; SELECTION; LOCI; GWAS;
D O I
10.1534/genetics.116.187161
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Predicting individual quantitative trait phenotypes from high-resolution genomic polymorphism data is important for personalized medicine in humans, plant and animal breeding, and adaptive evolution. However, this is difficult for populations of unrelated individuals when the number of causal variants is low relative to the total number of polymorphisms and causal variants individually have small effects on the traits. We hypothesized that mapping molecular polymorphisms to genomic features such as genes and their gene ontology categories could increase the accuracy of genomic prediction models. We developed a genomic feature best linear unbiased prediction (GFBLUP) model that implements this strategy and applied it to three quantitative traits (startle response, starvation resistance, and chill coma recovery) in the unrelated, sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel. Our results indicate that subsetting markers based on genomic features increases the predictive ability relative to the standard genomic best linear unbiased prediction (GBLUP) model. Both models use all markers, but GFBLUP allows differential weighting of the individual genetic marker relationships, whereas GBLUP weighs the genetic marker relationships equally. Simulation studies show that it is possible to further increase the accuracy of genomic prediction for complex traits using this model, provided the genomic features are enriched for causal variants. Our GFBLUP model using prior information on genomic features enriched for causal variants can increase the accuracy of genomic predictions in populations of unrelated individuals and provides a formal statistical framework for leveraging and evaluating information across multiple experimental studies to provide novel insights into the genetic architecture of complex traits.
引用
收藏
页码:1871 / +
页数:15
相关论文
共 57 条
[11]   Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor [J].
de los Campos, Gustavo ;
Vazquez, Ana I. ;
Fernando, Rohan ;
Klimentidis, Yann C. ;
Sorensen, Daniel .
PLOS GENETICS, 2013, 9 (07)
[12]   Reliability of Genomic Predictions Across Multiple Populations [J].
de Roos, A. P. W. ;
Hayes, B. J. ;
Goddard, M. E. .
GENETICS, 2009, 183 (04) :1545-1553
[13]   Partitioning of genomic variance reveals biological pathways associated with udder health and milk production traits in dairy cattle [J].
Edwards, Stefan M. ;
Thomsen, Bo ;
Madsen, Per ;
Sorensen, Peter .
GENETICS SELECTION EVOLUTION, 2015, 47
[14]   Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels [J].
Erbe, M. ;
Hayes, B. J. ;
Matukumalli, L. K. ;
Goswami, S. ;
Bowman, P. J. ;
Reich, C. M. ;
Mason, B. A. ;
Goddard, M. E. .
JOURNAL OF DAIRY SCIENCE, 2012, 95 (07) :4114-4129
[15]   Genome-wide patterns of latitudinal differentiation among populations of Drosophila melanogaster from North America [J].
Fabian, Daniel K. ;
Kapun, Martin ;
Nolte, Viola ;
Kofler, Robert ;
Schmidt, Paul S. ;
Schloetterer, Christian ;
Flatt, Thomas .
MOLECULAR ECOLOGY, 2012, 21 (19) :4748-4769
[16]  
Falconer D. S., 1996, Introduction to quantitative genetics.
[17]   Why do insects enter and recover from chill coma? Low temperature and high extracellular potassium compromise muscle function in Locusta migratoria [J].
Findsen, Anders ;
Pedersen, Thomas Holm ;
Petersen, Asbjorn Graver ;
Nielsen, Ole Baekgaard ;
Overgaard, Johannes .
JOURNAL OF EXPERIMENTAL BIOLOGY, 2014, 217 (08) :1297-1306
[18]   Partitioning heritability by functional annotation using genome-wide association summary statistics [J].
Finucane, Hilary K. ;
Bulik-Sullivan, Brendan ;
Gusev, Alexander ;
Trynka, Gosia ;
Reshef, Yakir ;
Loh, Po-Ru ;
Anttila, Verneri ;
Xu, Han ;
Zang, Chongzhi ;
Farh, Kyle ;
Ripke, Stephan ;
Day, Felix R. ;
Purcell, Shaun ;
Stahl, Eli ;
Lindstrom, Sara ;
Perry, John R. B. ;
Okada, Yukinori ;
Raychaudhuri, Soumya ;
Daly, Mark J. ;
Patterson, Nick ;
Neale, Benjamin M. ;
Price, Alkes L. .
NATURE GENETICS, 2015, 47 (11) :1228-+
[19]   Partitioning Heritability of Regulatory and Cell-Type-Specific Variants across 11 Common Diseases [J].
Gusev, Alexander ;
Lee, S. Hong ;
Trynka, Gosia ;
Finucane, Hilary ;
Vilhjalmsson, Bjarni J. ;
Xu, Han ;
Zang, Chongzhi ;
Ripke, Stephan ;
Bulik-Sullivan, Brendan ;
Stahl, Eli ;
Kaehler, Anna K. ;
Hultman, Christina M. ;
Purcell, Shaun M. ;
McCarroll, Steven A. ;
Daly, Mark ;
Pasaniuc, Bogdan ;
Sullivan, Patrick F. ;
Neale, Benjamin M. ;
Wray, Naomi R. ;
Raychaudhuri, Soumya ;
Price, Alkes L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2014, 95 (05) :535-552
[20]   The impact of genetic relationship information on genome-assisted breeding values [J].
Habier, D. ;
Fernando, R. L. ;
Dekkers, J. C. M. .
GENETICS, 2007, 177 (04) :2389-2397