A Function Accounting for Training Set Size and Marker Density to Model the Average Accuracy of Genomic Prediction

被引:45
作者
Erbe, Malena [1 ]
Gredler, Birgit [2 ]
Seefried, Franz Reinhold [2 ]
Bapst, Beat [2 ]
Simianer, Henner [1 ]
机构
[1] Univ Gottingen, Dept Anim Sci, Anim Breeding & Genet Grp, Gottingen, Germany
[2] Qualitas AG, Zug, Switzerland
关键词
BREEDING VALUES; LINKAGE DISEQUILIBRIUM; RELATIONSHIP MATRIX; SELECTION; IMPACT;
D O I
10.1371/journal.pone.0081046
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments (Me). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of similar to 698 Holstein Friesian bulls genotyped with 50 K SNPs and 19332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to,600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is w < 1. The proportion of genetic variance captured by the complete SNP sets (w(2)) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with,209000 SNPs in the Brown Swiss population studied.
引用
收藏
页数:11
相关论文
共 32 条
[1]  
[Anonymous], 2013, R LANG ENV STAT COMP
[2]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[3]   The Impact of Genetic Architecture on Genome-Wide Evaluation Methods [J].
Daetwyler, Hans D. ;
Pong-Wong, Ricardo ;
Villanueva, Beatriz ;
Woolliams, John A. .
GENETICS, 2010, 185 (03) :1021-1031
[4]   Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach [J].
Daetwyler, Hans D. ;
Villanueva, Beatriz ;
Woolliams, John A. .
PLOS ONE, 2008, 3 (10)
[5]  
Daetwyler HD, 2009, THESIS WAGENINGEN U
[6]   Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor [J].
de los Campos, Gustavo ;
Vazquez, Ana I. ;
Fernando, Rohan ;
Klimentidis, Yann C. ;
Sorensen, Daniel .
PLOS GENETICS, 2013, 9 (07)
[7]   Prediction of response to marker-assisted and genomic selection using selection index theory [J].
Dekkers, J. C. M. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2007, 124 (06) :331-341
[8]   Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels [J].
Erbe, M. ;
Hayes, B. J. ;
Matukumalli, L. K. ;
Goswami, S. ;
Bowman, P. J. ;
Reich, C. M. ;
Mason, B. A. ;
Goddard, M. E. .
JOURNAL OF DAIRY SCIENCE, 2012, 95 (07) :4114-4129
[9]  
Gilmore A.R., 2009, Asreml User Guide Release 3.0
[10]   Using the genomic relationship matrix to predict the accuracy of genomic selection [J].
Goddard, M. E. ;
Hayes, B. J. ;
Meuwissen, T. H. E. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2011, 128 (06) :409-421