A Function Accounting for Training Set Size and Marker Density to Model the Average Accuracy of Genomic Prediction

被引:42
作者
Erbe, Malena [1 ]
Gredler, Birgit [2 ]
Seefried, Franz Reinhold [2 ]
Bapst, Beat [2 ]
Simianer, Henner [1 ]
机构
[1] Univ Gottingen, Dept Anim Sci, Anim Breeding & Genet Grp, Gottingen, Germany
[2] Qualitas AG, Zug, Switzerland
来源
PLOS ONE | 2013年 / 8卷 / 12期
关键词
BREEDING VALUES; LINKAGE DISEQUILIBRIUM; RELATIONSHIP MATRIX; SELECTION; IMPACT;
D O I
10.1371/journal.pone.0081046
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments (Me). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of similar to 698 Holstein Friesian bulls genotyped with 50 K SNPs and 19332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to,600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is w < 1. The proportion of genetic variance captured by the complete SNP sets (w(2)) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with,209000 SNPs in the Brown Swiss population studied.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Effect of quality control, density and allele frequency of markers on the accuracy of genomic prediction for complex traits in Nellore cattle
    Bresolin, Tiago
    Rosa, Guilherme Jordao de Magalhaes
    Valente, Bruno Dourado
    Espigolan, Rafael
    Mansan Gordo, Daniel Gustavo
    Braz, Camila Urbano
    Fernandes Junior, Gerardo Alves
    Braga Magalhaes, Ana Fabricia
    Garcia, Diogo Anastacio
    Frezarim, Gabriela Bonfa
    Carneiro Leao, Guilherme Fonseca
    Carvalheiro, Roberto
    Baldi, Fernando
    de Oliveira, Henrique Nunes
    de Albuquerque, Lucia Galvao
    [J]. ANIMAL PRODUCTION SCIENCE, 2019, 59 (01) : 48 - 54
  • [42] Quality monitoring in petroleum refinery with regression neural network: Improving prediction accuracy with appropriate design of training set
    Singh, Harshvardhan
    Pani, Ajaya Kumar
    Mohanta, Hare Krishna
    [J]. MEASUREMENT, 2019, 134 : 698 - 709
  • [43] Development of a Genomic Prediction Pipeline for Maintaining Comparable Sample Sizes in Training and Testing Sets across Prediction Schemes Accounting for the Genotype-by-Environment Interaction
    Persa, Reyna
    Grondona, Martin
    Jarquin, Diego
    [J]. AGRICULTURE-BASEL, 2021, 11 (10):
  • [44] Extending the Marker x Environment Interaction Model for Genomic-Enabled Prediction and Genome-Wide Association Analysis in Durum Wheat
    Crossa, Jose
    de los Campos, Gustavo
    Maccaferri, Marco
    Tuberosa, Roberto
    Burgueno, J.
    Perez-Rodriguez, Paulino
    [J]. CROP SCIENCE, 2016, 56 (05) : 2193 - 2209
  • [45] Optimizing genomic prediction using low-density marker panels for streptococcosis resistance in red tilapia (Oreochromis spp.)
    Sukhavachana, S.
    Tongyoo, P.
    Luengnaruemitchai, A.
    Poompuang, S.
    [J]. ANIMAL GENETICS, 2021, 52 (05) : 667 - 674
  • [46] Genomic Prediction Accuracy Using Haplotypes Defined by Size and Hierarchical Clustering Based on Linkage Disequilibrium (vol 11, 134, 2020)
    Won, Sohyoung
    Park, Jong-Eun
    Son, Ju-Hwan
    Lee, Seung-Hwan
    Park, Byeong Ho
    Park, Mina
    Park, Won-Chul
    Chai, Han-Ha
    Kim, Heebal
    Lee, Jungjae
    Lim, Dajeong
    [J]. FRONTIERS IN GENETICS, 2021, 12
  • [47] The accuracy of prediction of genomic selection in elite hybrid rye populations surpasses the accuracy of marker-assisted selection and is equally augmented by multiple field evaluation locations and test years
    Wang, Yu
    Mette, Michael Florian
    Miedaner, Thomas
    Gottwald, Marlen
    Wilde, Peer
    Reif, Jochen C.
    Zhao, Yusheng
    [J]. BMC GENOMICS, 2014, 15
  • [48] Effect of minor allele frequency and density of single nucleotide polymorphism marker arrays on imputation performance and prediction ability using the single-step genomic Best Linear Unbiased Prediction in a simulated beef cattle population
    Rodriguez, Juan Diego
    Peripolli, Elisa
    Londono-Gil, Marisol
    Espigolan, Rafael
    Lobo, Raysildo Barbosa
    Lopez-Correa, Rodrigo
    Aguilar, Ignacio
    Baldi, Fernando
    [J]. ANIMAL PRODUCTION SCIENCE, 2023, 63 (09) : 844 - 852
  • [49] Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines
    Cericola, Fabio
    Jahoor, Ahmed
    Orabi, Jihad
    Andersen, Jeppe R.
    Janss, Luc L.
    Jensen, Just
    [J]. PLOS ONE, 2017, 12 (01):
  • [50] Accuracy of genomic prediction of host resistance to salmon lice in Atlantic salmon (Salmo salar) using imputed high-density genotypes
    Kjetsa, M. H.
    Odegard, J.
    Meuwissen, T. H. E.
    [J]. AQUACULTURE, 2020, 526