A multi-breed reference panel and additional rare variants maximize imputation accuracy in cattle

被引:31
|
作者
Rowan, Troy N. [1 ]
Hoff, Jesse L. [1 ]
Crum, Tamar E. [1 ]
Taylor, Jeremy F. [1 ]
Schnabel, Robert D. [1 ,2 ]
Decker, Jared E. [1 ,2 ]
机构
[1] Univ Missouri, Div Anim Sci, Columbia, MO 65211 USA
[2] Univ Missouri, Informat Inst, Columbia, MO 65211 USA
基金
美国食品与农业研究所;
关键词
DAIRY-CATTLE; GENOMIC SELECTION; IMPROVING ACCURACY; MILK-YIELD; NUCLEOTIDE; IDENTIFICATION; ASSOCIATION; PREDICTIONS; MUTATIONS; QTL;
D O I
10.1186/s12711-019-0519-x
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Background: During the last decade, the use of common-variant array-based single nucleotide polymorphism (SNP) genotyping in the beef and dairy industries has produced an astounding amount of medium-to-low density genomic data. Although low-density assays work well in the context of genomic prediction, they are less useful for detecting and mapping causal variants and the effects of rare variants are not captured. The objective of this project was to maximize the accuracies of genotype imputation from medium- and low-density assays to the marker set obtained by combining two high-density research assays (similar to 850,000 SNPs), the Illumina BovineHD and the GGP-F250 assays, which contains a large proportion of rare and potentially functional variants and for which the assay design is described here. This 850 K SNP set is useful for both imputation to sequence-level genotypes and direct downstream analysis. Results: We found that a large multi-breed composite imputation reference panel that includes 36,131 samples with either BovineHD and/or GGP-F250 genotypes significantly increased imputation accuracy compared with a within-breed reference panel, particularly at variants with low minor allele frequencies. Individual animal imputation accuracies were maximized when more genetically similar animals were represented in the composite reference panel, particularly with complete 850 K genotypes. The addition of rare variants from the GGP-F250 assay to our composite reference panel significantly increased the imputation accuracy of rare variants that are exclusively present on the BovineHD assay. In addition, we show that an assay marker density of 50 K SNPs balances cost and accuracy for imputation to 850 K. Conclusions: Using high-density genotypes on all available individuals in a multi-breed reference panel maximized imputation accuracy for tested cattle populations. Admixed animals or those from breeds with a limited representation in the composite reference panel were still imputed at high accuracy, which is expected to further increase as the reference panel expands. We anticipate that the addition of rare variants from the GGP-F250 assay will increase the accuracy of imputation to sequence level.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A multi-breed reference panel and additional rare variants maximize imputation accuracy in cattle
    Troy N. Rowan
    Jesse L. Hoff
    Tamar E. Crum
    Jeremy F. Taylor
    Robert D. Schnabel
    Jared E. Decker
    Genetics Selection Evolution, 51
  • [2] A large multi-breed reference panel and maximizes genotype imputation accuracy in cattle.
    Rowan, Troy
    Hoff, Jesse L.
    Crum, Tamar
    Taylor, Jerry F.
    Schnabel, Robert
    Decker, Jared E.
    JOURNAL OF ANIMAL SCIENCE, 2019, 97 : 18 - 18
  • [3] Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle
    Rasmus Froberg Brøndum
    Bernt Guldbrandtsen
    Goutam Sahana
    Mogens Sandø Lund
    Guosheng Su
    BMC Genomics, 15
  • [4] Assessing accuracy of imputation using different SNP panel densities in a multi-breed sheep population
    Ventura, Ricardo V.
    Miller, Stephen P.
    Dodds, Ken G.
    Auvray, Benoit
    Lee, Michael
    Bixley, Matthew
    Clarke, Shannon M.
    McEwan, John C.
    GENETICS SELECTION EVOLUTION, 2016, 48
  • [5] Assessing accuracy of imputation using different SNP panel densities in a multi-breed sheep population
    Ricardo V. Ventura
    Stephen P. Miller
    Ken G. Dodds
    Benoit Auvray
    Michael Lee
    Matthew Bixley
    Shannon M. Clarke
    John C. McEwan
    Genetics Selection Evolution, 48
  • [6] Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle
    Brondum, Rasmus Froberg
    Guldbrandtsen, Bernt
    Sahana, Goutam
    Lund, Mogens Sando
    Su, Guosheng
    BMC GENOMICS, 2014, 15
  • [7] A comprehensive evaluation of factors affecting the accuracy of pig genotype imputation using a single or multi-breed reference population
    ZHANG Kai-li
    PENG Xia
    ZHANG Sai-xian
    ZHAN Hui-wen
    LU Jia-hui
    XIE Sheng-song
    ZHAO Shu-hong
    LI Xin-yun
    MA Yun-long
    JournalofIntegrativeAgriculture, 2022, 21 (02) : 486 - 495
  • [8] A comprehensive evaluation of factors affecting the accuracy of pig genotype imputation using a single or multi-breed reference population
    Zhang Kai-li
    Peng Xia
    Zhang Sai-xian
    Zhan Hui-wen
    Lu Jia-hui
    Xie Sheng-song
    Zhao Shu-hong
    Li Xin-yun
    Ma Yun-long
    JOURNAL OF INTEGRATIVE AGRICULTURE, 2022, 21 (02) : 486 - 495
  • [9] Accuracy of genomic breeding values in multi-breed dairy cattle populations
    Hayes, Ben J.
    Bowman, Phillip J.
    Chamberlain, Amanda C.
    Verbyla, Klara
    Goddard, Mike E.
    GENETICS SELECTION EVOLUTION, 2009, 41
  • [10] Accuracy of genomic breeding values in multi-breed dairy cattle populations
    Ben J Hayes
    Phillip J Bowman
    Amanda C Chamberlain
    Klara Verbyla
    Mike E Goddard
    Genetics Selection Evolution, 41