Accuracy of imputation to whole-genome sequence in sheep

被引:40
|
作者
Bolormaa, Sunduimijid [1 ,2 ]
Chamberlain, Amanda J. [1 ]
Khansefid, Majid [1 ,2 ]
Stothard, Paul [3 ]
Swan, Andrew A. [2 ,4 ]
Mason, Brett [1 ]
Prowse-Wilkins, Claire P. [1 ]
Duijvesteijn, Naomi [2 ,5 ]
Moghaddar, Nasir [2 ,5 ]
van der Werf, Julius H. [2 ,5 ]
Daetwyler, Hans D. [1 ,2 ,6 ]
MacLeod, Iona M. [1 ,2 ]
机构
[1] Agr Victoria, Ctr AgriBiosci, AgriBio, 5 Ring Rd, Bundoora, Vic 3083, Australia
[2] Cooperat Res Ctr Sheep Ind Innovat, Armidale, NSW 2351, Australia
[3] Univ Alberta, Fac Agr Life & Environm Sci, Edmonton, AB T6G 2R3, Canada
[4] Univ New England, Anim Genet & Breeding Unit, Armidale, NSW 2351, Australia
[5] Univ New England, Sch Environm & Rural Sci, Armidale, NSW 2351, Australia
[6] La Trobe Univ, Sch Appl Syst Biol, Bundoora, Vic 3086, Australia
关键词
GENOTYPE IMPUTATION; PREDICTIONS; BREEDS; RELIABILITY; VARIANTS; IMPROVE; DESIGN;
D O I
10.1186/s12711-018-0443-5
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
BackgroundThe use of whole-genome sequence (WGS) data for genomic prediction and association studies is highly desirable because the causal mutations should be present in the data. The sequencing of 935 sheep from a range of breeds provides the opportunity to impute sheep genotyped with single nucleotide polymorphism (SNP) arrays to WGS. This study evaluated the accuracy of imputation from SNP genotypes to WGS using this reference population of 935 sequenced sheep.ResultsThe accuracy of imputation from the Ovine Infinium((R)) HD BeadChip SNP (similar to 500k) to WGS was assessed for three target breeds: Merino, Poll Dorset and F1 Border LeicesterxMerino. Imputation accuracy was highest for the Poll Dorset breed, although there were more Merino individuals in the sequenced reference population than Poll Dorset individuals. In addition, empirical imputation accuracies were higher (by up to 1.7%) when using larger multi-breed reference populations compared to using a smaller single-breed reference population. The mean accuracy of imputation across target breeds using the Minimac3 or the FImpute software was 0.94. The empirical imputation accuracy varied considerably across the genome; six chromosomes carried regions of one or more Mb with a mean imputation accuracy of <0.7. Imputation accuracy in five variant annotation classes ranged from 0.87 (missense) up to 0.94 (intronic variants), where lower accuracy corresponded to higher proportions of rare alleles. The imputation quality statistic reported from Minimac3 (R-2) had a clear positive relationship with the empirical imputation accuracy. Therefore, by first discarding imputed variants with an R-2 below 0.4, the mean empirical accuracy across target breeds increased to 0.97. Although accuracy of genomic prediction was less affected by filtering on R-2 in a multi-breed population of sheep with imputed WGS, the genomic heritability clearly tended to be lower when using variants with an R-2 0.4.ConclusionsThe mean imputation accuracy was high for all target breeds and was increased by combining smaller breed sets into a multi-breed reference. We found that the Minimac3 software imputation quality statistic (R-2) was a useful indicator of empirical imputation accuracy, enabling removal of very poorly imputed variants before downstream analyses.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Imputation to whole-genome sequence and its use in genome-wide association studies for pork colour traits in crossbred and purebred pigs
    Heidaritabar, Marzieh
    Huisman, Abe
    Krivushin, Kirill
    Stothard, Paul
    Dervishi, Elda
    Charagu, Patrick
    Bink, Marco C. A. M.
    Plastow, Graham S.
    FRONTIERS IN GENETICS, 2022, 13
  • [42] Genome-Wide Association Study on Reproductive Traits Using Imputation-Based Whole-Genome Sequence Data in Yorkshire Pigs
    Sun, Jingchun
    Xiao, Jinhong
    Jiang, Yifan
    Wang, Yaxin
    Cao, Minghao
    Wei, Jialin
    Yu, Taiyong
    Ding, Xiangdong
    Yang, Gongshe
    GENES, 2023, 14 (04)
  • [43] Whole-genome bisulfite sequencing with improved accuracy and cost
    Suzuki, Masako
    Liao, Will
    Wos, Frank
    Johnston, Andrew D.
    DeGrazia, Justin
    Ishii, Jennifer
    Bloom, Toby
    Zody, Michael C.
    Germer, Soren
    Greally, John M.
    GENOME RESEARCH, 2018, 28 (09) : 1364 - 1371
  • [45] Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
    Hailiang Song
    Shaopan Ye
    Yifan Jiang
    Zhe Zhang
    Qin Zhang
    Xiangdong Ding
    Genetics Selection Evolution, 51
  • [46] Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
    Song, Hailiang
    Ye, Shaopan
    Jiang, Yifan
    Zhang, Zhe
    Zhang, Qin
    Ding, Xiangdong
    GENETICS SELECTION EVOLUTION, 2019, 51 (01)
  • [47] Whole-genome sequence of Macaca fascicularis: liver tissue
    Seo, Eun-Hye
    Kim, Jeong-Hwan
    Kim, Da-Hee
    Oh, Jung-Hwa
    Kim, Seon-Young
    BMC GENOMIC DATA, 2023, 24 (01):
  • [48] Addressing chromosome evolution in the whole-genome sequence era
    Faraut, Thomas
    CHROMOSOME RESEARCH, 2008, 16 (01) : 5 - 16
  • [49] Whole-genome sequence annotation: 'Going wrong with confidence'
    Kyrpides, NC
    Ouzounis, CA
    MOLECULAR MICROBIOLOGY, 1999, 32 (04) : 886 - 887
  • [50] Whole-Genome Sequence of the Mycoplasma mucosicanis Type Strain
    Tallmadge, Rebecca L.
    Mitchell, Patrick K.
    Anderson, Renee
    Franklin-Guild, Rebecca
    Goodman, Laura B.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2019, 8 (41):