Components of the accuracy of genomic prediction in a multi-breed sheep population

被引:119
作者
Daetwyler, H. D. [1 ,2 ]
Kemper, K. E. [1 ,3 ]
van der Werf, J. H. J. [2 ,4 ]
Hayes, B. J. [1 ,2 ,5 ]
机构
[1] Dept Primary Ind, Biosci Res Div, Bundoora, Vic 3083, Australia
[2] Cooperat Res Ctr Sheep Ind Innovat, Armidale, NSW 2351, Australia
[3] Univ Melbourne, Fac Land & Environm, Parkville, Vic 3010, Australia
[4] Univ New England, Sch Environm & Rural Sci, Armidale, NSW 2351, Australia
[5] La Trobe Univ, Bundoora, Vic 3086, Australia
关键词
genomic prediction; genomic selection; principal component analysis; population structure; relationship; sheep; GENETIC-RELATIONSHIP INFORMATION; QUANTITATIVE TRAIT LOCI; COMPLEX TRAITS; VALUES; STRATIFICATION; IMPACT; MEAT; DETECT; SCAN; WOOL;
D O I
10.2527/jas.2011-4557
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
In genome-wide association studies, failure to remove variation due to population structure results in spurious associations. In contrast, for predictions of future phenotypes or estimated breeding values from dense SNP data, exploiting population structure arising from relatedness can actually increase the accuracy of prediction in some cases, for example, when the selection candidates are offspring of the reference population where the prediction equation was derived. In populations with large effective population size or with multiple breeds and strains, it has not been demonstrated whether and when accounting for or removing variation due to population structure will affect the accuracy of genomic prediction. Our aim in this study was to determine whether accounting for population structure would increase the accuracy of genomic predictions, both within and across breeds. First, we have attempted to decompose the accuracy of genomic prediction into contributions from population structure or linkage disequilibrium (LD) between markers and QTL using a diverse multi-breed sheep (Ovis aries) data set, genotyped for 48,640 SNP. We demonstrate that SNP from a single chromosome can achieve up to 86% of the accuracy for genomic predictions using all SNP. This result suggests that most of the prediction accuracy is due to population structure, because a single chromosome is expected to capture relationships but is unlikely to contain all QTL. We then explored principal component analysis (PCA) as an approach to disentangle the respective contributions of population structure and LD between SNP and QTL to the accuracy of genomic predictions. Results showed that fitting an increasing number of principle components (PC; as covariates) decreased within breed accuracy until a lower plateau was reached. We speculate that this plateau is a measure of the accuracy due to LD. In conclusion, a large proportion of the accuracy for genomic predictions in our data was due to variation associated with population structure. Surprisingly, accounting for this structure generally decreased the accuracy of across breed genomic predictions.
引用
收藏
页码:3375 / 3384
页数:10
相关论文
共 30 条
[1]  
[Anonymous], 2010, R LANG ENV STAT COMP
[2]   A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals [J].
Browning, Brian L. ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (02) :210-223
[3]   A genome scan to detect quantitative trait loci for economically important traits in Holstein cattle using two methods and a dense single nucleotide polymorphism map [J].
Daetwyler, H. D. ;
Schenkel, F. S. ;
Sargolzaei, M. ;
Robinson, J. A. B. .
JOURNAL OF DAIRY SCIENCE, 2008, 91 (08) :3225-3236
[4]   Accuracy of estimated genomic breeding values for wool and meat traits in a multi-breed sheep population [J].
Daetwyler, H. D. ;
Hickey, J. M. ;
Henshall, J. M. ;
Dominik, S. ;
Gredler, B. ;
van der Werf, J. H. J. ;
Hayes, B. J. .
ANIMAL PRODUCTION SCIENCE, 2010, 50 (11-12) :1004-1010
[5]   Reliability of Genomic Predictions Across Multiple Populations [J].
de Roos, A. P. W. ;
Hayes, B. J. ;
Goddard, M. E. .
GENETICS, 2009, 183 (04) :1545-1553
[6]   Assessing the impact of population stratification on genetic association studies [J].
Freedman, ML ;
Reich, D ;
Penney, KL ;
McDonald, GJ ;
Mignault, AA ;
Patterson, N ;
Gabriel, SB ;
Topol, EJ ;
Smoller, JW ;
Pato, CN ;
Pato, MT ;
Petryshen, TYL ;
Kolonel, LN ;
Lander, ES ;
Sklar, P ;
Henderson, B ;
Hirschhorn, JN ;
Altshuler, D .
NATURE GENETICS, 2004, 36 (04) :388-393
[7]  
Gilmour A.R., 2009, 2009 ASReml User Guide Release 3.0
[8]   Mapping genes for complex traits in domestic animals and their use in breeding programmes [J].
Goddard, Michael E. ;
Hayes, Ben J. .
NATURE REVIEWS GENETICS, 2009, 10 (06) :381-391
[9]   The impact of genetic relationship information on genome-assisted breeding values [J].
Habier, D. ;
Fernando, R. L. ;
Dekkers, J. C. M. .
GENETICS, 2007, 177 (04) :2389-2397
[10]   The impact of genetic relationship information on genomic breeding values in German Holstein cattle [J].
Habier, David ;
Tetens, Jens ;
Seefried, Franz-Reinhold ;
Lichtner, Peter ;
Thaller, Georg .
GENETICS SELECTION EVOLUTION, 2010, 42