Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies

被引:54
作者
van den Berg, Sanne [1 ,2 ]
Vandenplas, Jeremie [1 ]
van Eeuwijk, Fred A. [2 ]
Bouwman, Aniek C. [1 ]
Lopes, Marcos S. [3 ,4 ]
Veerkamp, Roel F. [1 ]
机构
[1] Wageningen Univ & Res, Anim Breeding & Genom, POB 338, NL-6700 AH Wageningen, Netherlands
[2] Wageningen Univ & Res, Biometris, POB 16, NL-6700 AA Wageningen, Netherlands
[3] Topigs Norsvin Res Ctr, NL-6640 AA Beuningen, Netherlands
[4] Topigs Norsvin, BR-80420190 Curitiba, Parana, Brazil
关键词
QUANTITATIVE TRAIT LOCI; GENOTYPE IMPUTATION; LINKAGE DISEQUILIBRIUM; GENETIC DIVERSITY; TEAT NUMBER; ACCURACY; HOLSTEIN; INFERENCE; MARKERS; CATTLE;
D O I
10.1186/s12711-019-0445-y
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
BackgroundUse of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS.MethodsPhenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660K and 80K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80K and 660K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R-2) higher than 0.6, and the dosage scores of all iWGS SNPs.ResultsFor the DL-line (LW-line), imputation of 80K genotypes to iWGS resulted in an average Beagle R-2 of 0.39 (0.49). After quality control, 2.5x10(6) (3.5x10(6)) SNPs had a Beagle R-2 higher than 0.6, resulting in an average Beagle R-2 of 0.83 (0.93). Compared to the 80K and 660K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis.ConclusionsImputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping.
引用
收藏
页数:13
相关论文
共 59 条
  • [1] [Anonymous], ALIGNING SEQUENCE RE, DOI DOI 10.48550/ARXIV.1303.3997
  • [2] Estimation of linkage disequilibrium in four US pig breeds
    Badke, Yvonne M.
    Bates, Ronald O.
    Ernst, Catherine W.
    Schwab, Clint
    Steibel, Juan P.
    [J]. BMC GENOMICS, 2012, 13
  • [3] Detection of quantitative trait loci for teat number and female reproductive traits in Meishan X Large White F2 pigs
    Bidanel, J. P.
    Rosendo, A.
    Iannuccelli, N.
    Riquet, J.
    Gilbert, H.
    Caritez, J. C.
    Billon, Y.
    Amigues, Y.
    Prunier, A.
    Milan, D.
    [J]. ANIMAL, 2008, 2 (06) : 813 - 820
  • [4] Adaptation of Maize to Temperate Climates: Mid-Density Genome-Wide Association Genetics and Diversity Patterns Reveal Key Genomic Regions, with a Major Contribution of the Vgt2 (ZCN8) Locus
    Bouchet, Sophie
    Servin, Bertrand
    Bertin, Pascal
    Madur, Delphine
    Combes, Valerie
    Dumas, Fabrice
    Brunel, Dominique
    Laborde, Jacques
    Charcosset, Alain
    Nicolas, Stephane
    [J]. PLOS ONE, 2013, 8 (08):
  • [5] Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy
    Bouwman, Aniek C.
    Veerkamp, Roel F.
    [J]. BMC GENETICS, 2014, 15
  • [6] Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction
    Brondum, R. F.
    Su, G.
    Janss, L.
    Sahana, G.
    Guldbrandtsen, B.
    Boichard, D.
    Lund, M. S.
    [J]. JOURNAL OF DAIRY SCIENCE, 2015, 98 (06) : 4107 - 4116
  • [7] Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle
    Brondum, Rasmus Froberg
    Guldbrandtsen, Bernt
    Sahana, Goutam
    Lund, Mogens Sando
    Su, Guosheng
    [J]. BMC GENOMICS, 2014, 15
  • [8] Genotype Imputation with Millions of Reference Samples
    Browning, Brian L.
    Browning, Sharon R.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 98 (01) : 116 - 126
  • [9] A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals
    Browning, Brian L.
    Browning, Sharon R.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (02) : 210 - 223
  • [10] Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering
    Browning, Sharon R.
    Browning, Brian L.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) : 1084 - 1097