Impact of Marker Pruning Strategies Based on Different Measurements of Marker Distance on Genomic Prediction in Dairy Cattle

被引:3
作者
Ren, Duanyang [1 ]
Teng, Jinyan [1 ]
Diao, Shuqi [1 ]
Lin, Qing [1 ]
Li, Jiaqi [1 ]
Zhang, Zhe [1 ]
机构
[1] South China Agr Univ, Coll Anim Sci, Guangdong Lab Lingnan Modern Agr, Guangdong Prov Key Lab Agroanim Genom & Mol Breed, Guangzhou 510642, Peoples R China
来源
ANIMALS | 2021年 / 11卷 / 07期
基金
中国国家自然科学基金;
关键词
genomic prediction; marker density; genetic distance; physical distance; Holstein dairy cattle; high-density SNP data; SINGLE NUCLEOTIDE POLYMORPHISM; GENETIC ARCHITECTURE; DENSITY; ACCURACY; IMPUTATION; SELECTION; SNP; ASSOCIATION; EFFICIENT; VARIANCE;
D O I
10.3390/ani11071992
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
With the availability of high-density single-nucleotide polymorphism (SNP) data and the development of genotype imputation methods, high-density panel-based genomic prediction (GP) has become possible in livestock breeding. It is generally considered that the genomic estimated breeding value (GEBV) accuracy increases with the marker density, while studies have shown that the GEBV accuracy does not increase or even decrease when high-density panels were used. Therefore, in addition to the SNP number, other measurements of 'marker density' seem to have impacts on the GEBV accuracy, and exploring the relationship between the GEBV accuracy and the measurements of 'marker density' based on high-density SNP or whole-genome sequence data is important for the field of GP. In this study, we constructed different SNP panels with certain SNP numbers (e.g., 1 k) by using the physical distance (PhyD), genetic distance (GenD) and random distance (RanD) between SNPs respectively based on the high-density SNP data of a Germany Holstein dairy cattle population. Therefore, there are three different panels at a certain SNP number level. These panels were used to construct GP models to predict fat percentage, milk yield and somatic cell score. Meanwhile, the mean ((d) over bar) and variance (sigma(2)(d)) of the physical distance between SNPs and the mean (r (2) over bar) and variance (sigma(2)(r2)) of the genetic distance between SNPs in each panel were used as marker density-related measurements and their influence on the GEBV accuracy was investigated. At the same SNP number level, the d of all panels is basically the same, but the sigma(2)(d), r((2) over bar) and sigma(2)(r2) are different. Therefore, we only investigated the effects of sigma(2)(d), r((2) over bar) and sigma(2)(r2) on the GEBV accuracy. The results showed that at a certain SNP number level, the GEBV accuracy was negatively correlated with sigma(2)(d), but not with r((2) over bar) and sigma(2)(r2). Compared with GenD and RanD, the sigma(2)(d) of panels constructed by PhyD is smaller. The low and moderate-density panels (< 50 k) constructed by RanD or GenD have large sigma(2)(d), which is not conducive to genomic prediction. The GEBV accuracy of the low and moderate-density panels constructed by PhyD is 3.8 similar to 34.8% higher than that of the low and moderate-density panels constructed by RanD and GenD. Panels with 20-30 k SNPs constructed by PhyD can achieve the same or slightly higher GEBV accuracy than that of high-density SNP panels for all three traits. In summary, the smaller the variation degree of physical distance between adjacent SNPs, the higher the GEBV accuracy. The low and moderate-density panels construct by physical distance are beneficial to genomic prediction, while pruning high-density SNP data based on genetic distance is detrimental to genomic prediction. The results provide suggestions for the development of SNP panels and the research of genome prediction based on whole-genome sequence data.
引用
收藏
页数:15
相关论文
共 45 条
  • [1] The feasibility of using low-density marker panels for genotype imputation and genomic prediction of crossbred dairy cattle of East Africa
    Aliloo, H.
    Mrode, R.
    Okeyo, A. M.
    Ni, G.
    Goddard, M. E.
    Gibson, J. P.
    [J]. JOURNAL OF DAIRY SCIENCE, 2018, 101 (10) : 9108 - 9127
  • [2] Genomic Predictions Using Low-Density SNP Markers, Pedigree and GWAS Information: A Case Study with the Non-Model Species Eucalyptus cladocalyx
    Ballesta, Paulina
    Bush, David
    Silva, Fabyano Fonseca
    Mora, Freddy
    [J]. PLANTS-BASEL, 2020, 9 (01):
  • [3] Imputation of genotypes from low- to high-density genotyping platforms and implications for genomic selection
    Berry, D. P.
    Kearney, J. F.
    [J]. ANIMAL, 2011, 5 (08) : 1162 - 1169
  • [4] Design of a low-density SNP chip for the main Australian sheep breeds and its effect on imputation and genomic prediction accuracy
    Bolormaa, S.
    Gore, K.
    van der Werf, J. H. J.
    Hayes, B. J.
    Daetwyler, H. D.
    [J]. ANIMAL GENETICS, 2015, 46 (05) : 544 - 556
  • [5] Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction
    Brondum, R. F.
    Su, G.
    Janss, L.
    Sahana, G.
    Guldbrandtsen, B.
    Boichard, D.
    Lund, M. S.
    [J]. JOURNAL OF DAIRY SCIENCE, 2015, 98 (06) : 4107 - 4116
  • [6] Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle
    Brondum, Rasmus Froberg
    Guldbrandtsen, Bernt
    Sahana, Goutam
    Lund, Mogens Sando
    Su, Guosheng
    [J]. BMC GENOMICS, 2014, 15
  • [7] Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering
    Browning, Sharon R.
    Browning, Brian L.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) : 1084 - 1097
  • [8] Accuracy of breeding values when using and ignoring the polygenic effect in genomic breeding value estimation with a marker density of one SNP per cM
    Calus, M. P. L.
    Veerkamp, R. F.
    [J]. JOURNAL OF ANIMAL BREEDING AND GENETICS, 2007, 124 (06) : 362 - 368
  • [9] Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle
    Daetwyler, Hans D.
    Capitan, Aurelien
    Pausch, Hubert
    Stothard, Paul
    Van Binsbergen, Rianne
    Brondum, Rasmus F.
    Liao, Xiaoping
    Djari, Anis
    Rodriguez, Sabrina C.
    Grohs, Cecile
    Esquerre, Diane
    Bouchez, Olivier
    Rossignol, Marie-Noelle
    Klopp, Christophe
    Rocha, Dominique
    Fritz, Sebastien
    Eggen, Andre
    Bowman, Phil J.
    Coote, David
    Chamberlain, Amanda J.
    Anderson, Charlotte
    VanTassell, Curt P.
    Hulsegge, Ina
    Goddard, Mike E.
    Guldbrandtsen, Bernt
    Lund, Mogens S.
    Veerkamp, Roel F.
    Boichard, Didier A.
    Fries, Ruedi
    Hayes, Ben J.
    [J]. NATURE GENETICS, 2014, 46 (08) : 858 - 865
  • [10] Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions
    Druet, T.
    Macleod, I. M.
    Hayes, B. J.
    [J]. HEREDITY, 2014, 112 (01) : 39 - 47