A Function Accounting for Training Set Size and Marker Density to Model the Average Accuracy of Genomic Prediction

被引:43
作者
Erbe, Malena [1 ]
Gredler, Birgit [2 ]
Seefried, Franz Reinhold [2 ]
Bapst, Beat [2 ]
Simianer, Henner [1 ]
机构
[1] Univ Gottingen, Dept Anim Sci, Anim Breeding & Genet Grp, Gottingen, Germany
[2] Qualitas AG, Zug, Switzerland
来源
PLOS ONE | 2013年 / 8卷 / 12期
关键词
BREEDING VALUES; LINKAGE DISEQUILIBRIUM; RELATIONSHIP MATRIX; SELECTION; IMPACT;
D O I
10.1371/journal.pone.0081046
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments (Me). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of similar to 698 Holstein Friesian bulls genotyped with 50 K SNPs and 19332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to,600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is w < 1. The proportion of genetic variance captured by the complete SNP sets (w(2)) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with,209000 SNPs in the Brown Swiss population studied.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Training Set Construction for Genomic Prediction in Auto-Tetraploids: An Example in Potato
    Wilson, Stefan
    Malosetti, Marcos
    Maliepaard, Chris
    Mulder, Han A.
    Visser, Richard G. F.
    van Eeuwijk, Fred
    FRONTIERS IN PLANT SCIENCE, 2021, 12
  • [22] The trade-off between density marker panels size and predictive ability of genomic prediction for agronomic traits in Coffea canephora
    de Sousa, Ithalo Coelho
    Barreto, Cynthia Aparecida Valiati
    Caixeta, Eveline Teixeira
    Nascimento, Ana Carolina Campana
    Azevedo, Camila Ferreira
    Alkimim, Emilly Ruas
    Nascimento, Moyses
    EUPHYTICA, 2024, 220 (04)
  • [23] Genomic prediction in hybrid breeding: I. Optimizing the training set design
    Melchinger, Albrecht E. E.
    Fernando, Rohan
    Stricker, Christian
    Schoen, Chris-Carolin
    Auinger, Hans-Juergen
    THEORETICAL AND APPLIED GENETICS, 2023, 136 (08)
  • [24] Accuracy of genomic prediction for seed oil concentration in high-oleic soybean populations using a low-density marker panel
    Hemingway, Joel
    Schnebly, Steve R.
    Rajcan, Istvan
    CROP SCIENCE, 2021, 61 (06) : 4012 - 4021
  • [25] Accuracy of Genomic Prediction in Switchgrass (Panicum virgatum L.) Improved by Accounting for Linkage Disequilibrium
    Ramstein, Guillaume P.
    Evans, Joseph
    Kaeppler, Shawn M.
    Mitchell, Robert B.
    Vogel, Kenneth P.
    Buell, C. Robin
    Casler, Michael D.
    G3-GENES GENOMES GENETICS, 2016, 6 (04): : 1049 - 1062
  • [26] The impact of information quantity and strength of relationship between training set and validation set on accuracy of genomic estimated breeding values
    Saatchi, M.
    Miraei-Ashtiani, S. R.
    Javaremi, A. Nejati
    Moradi-Shahrebabak, M.
    Mehrabani-Yeghaneh, H.
    AFRICAN JOURNAL OF BIOTECHNOLOGY, 2010, 9 (04): : 438 - 442
  • [27] Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
    Lee, DooHo
    Kim, Yeongkuk
    Chung, Yoonji
    Lee, Dongjae
    Seo, Dongwon
    Choi, Tae Jeong
    Lim, Dajeong
    Yoon, Duhak
    Lee, Seung Hwan
    JOURNAL OF ANIMAL SCIENCE AND TECHNOLOGY, 2021, 63 (06) : 1232 - 1246
  • [28] Haplotype analysis of genomic prediction by incorporating genomic pathway information based on high-density SNP marker in Chinese yellow-feathered chicken
    Ye, Haoqiang
    Xu, Zhenqiang
    Bello, Semiu Folaniyi
    Zhu, Qianghui
    Kong, Shaofen
    Zheng, Ming
    Fang, Xiang
    Jia, Xinzheng
    Xu, Haiping
    Zhang, Xiquan
    Nie, Qinghua
    POULTRY SCIENCE, 2023, 102 (05)
  • [29] Impact of Training Set Size and Lead Time on Early Tomato Crop Mapping Accuracy
    Croci, Michele
    Impollonia, Giorgio
    Blandinieres, Henri
    Colauzzi, Michele
    Amaducci, Stefano
    REMOTE SENSING, 2022, 14 (18)
  • [30] The effect of marker types and density on genomic prediction and GWAS of key performance traits in tetraploid potato
    Aalborg, Trine
    Sverrisdottir, Elsa
    Kristensen, Heidi Thorgaard
    Nielsen, Kare Lehmann
    FRONTIERS IN PLANT SCIENCE, 2024, 15