On the use of whole-genome sequence data for across-breed genomic prediction and fine-scale mapping of QTL

被引:25
|
作者
Meuwissen, Theo [1 ]
van den Berg, Irene [2 ]
Goddard, Mike [2 ,3 ]
机构
[1] Norwegian Univ Life Sci, Box 5003, N-1432 As, Norway
[2] Agr Victoria, Bundoora, Vic, Australia
[3] Univ Melbourne, Fac Vet & Agr Sci, Parkville, Vic, Australia
关键词
D O I
10.1186/s12711-021-00607-4
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Background Whole-genome sequence (WGS) data are increasingly available on large numbers of individuals in animal and plant breeding and in human genetics through second-generation resequencing technologies, 1000 genomes projects, and large-scale genotype imputation from lower marker densities. Here, we present a computationally fast implementation of a variable selection genomic prediction method, that could handle WGS data on more than 35,000 individuals, test its accuracy for across-breed predictions and assess its quantitative trait locus (QTL) mapping precision. Methods The Monte Carlo Markov chain (MCMC) variable selection model (Bayes GC) fits simultaneously a genomic best linear unbiased prediction (GBLUP) term, i.e. a polygenic effect whose correlations are described by a genomic relationship matrix (G), and a Bayes C term, i.e. a set of single nucleotide polymorphisms (SNPs) with large effects selected by the model. Computational speed is improved by a Metropolis-Hastings sampling that directs computations to the SNPs, which are, a priori, most likely to be included into the model. Speed is also improved by running many relatively short MCMC chains. Memory requirements are reduced by storing the genotype matrix in binary form. The model was tested on a WGS dataset containing Holstein, Jersey and Australian Red cattle. The data contained 4,809,520 genotypes on 35,549 individuals together with their milk, fat and protein yields, and fat and protein percentage traits. Results The prediction accuracies of the Jersey individuals improved by 1.5% when using across-breed GBLUP compared to within-breed predictions. Using WGS instead of 600 k SNP-chip data yielded on average a 3% accuracy improvement for Australian Red cows. QTL were fine-mapped by locating the SNP with the highest posterior probability of being included in the model. Various QTL known from the literature were rediscovered, and a new SNP affecting milk production was discovered on chromosome 20 at 34.501126 Mb. Due to the high mapping precision, it was clear that many of the discovered QTL were the same across the five dairy traits. Conclusions Across-breed Bayes GC genomic prediction improved prediction accuracies compared to GBLUP. The combination of across-breed WGS data and Bayesian genomic prediction proved remarkably effective for the fine-mapping of QTL.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] On the use of whole-genome sequence data for across-breed genomic prediction and fine-scale mapping of QTL
    Theo Meuwissen
    Irene van den Berg
    Mike Goddard
    Genetics Selection Evolution, 53
  • [2] Utility of whole-genome sequence data for across-breed genomic prediction
    Raymond, Biaty
    Bouwman, Aniek C.
    Schrooten, Chris
    Houwing-Duistermaat, Jeanine
    Veerkamp, Roel F.
    GENETICS SELECTION EVOLUTION, 2018, 50
  • [3] Utility of whole-genome sequence data for across-breed genomic prediction
    Biaty Raymond
    Aniek C. Bouwman
    Chris Schrooten
    Jeanine Houwing-Duistermaat
    Roel F. Veerkamp
    Genetics Selection Evolution, 50
  • [4] Within- and across-breed genomic prediction using whole-genome sequence and single nucleotide polymorphism panels
    Oscar O. M. Iheshiulor
    John A. Woolliams
    Xijiang Yu
    Robin Wellmann
    Theo H. E. Meuwissen
    Genetics Selection Evolution, 48
  • [5] Within- and across-breed genomic prediction using whole-genome sequence and single nucleotide polymorphism panels
    Iheshiulor, Oscar O. M.
    Woolliams, John A.
    Yu, Xijiang
    Wellmann, Robin
    Meuwissen, Theo H. E.
    GENETICS SELECTION EVOLUTION, 2016, 48
  • [6] Use of whole-genome sequence data for fine mapping and genomic prediction of sea louse resistance in Atlantic salmon
    Onabanjo, Olumide
    Meuwissen, Theo
    Aslam, Muhammad Luqman
    Schmitt, Armin Otto
    Dagnachew, Binyam
    FRONTIERS IN GENETICS, 2024, 15
  • [7] Genomic prediction in a numerically small breed population using prioritized genetic markers from whole-genome sequence data
    Moghaddar, Nasir
    Brown, Daniel J.
    Swan, Andrew A.
    Gurman, Phillip M.
    Li, Li
    van der Werf, Julius H.
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2022, 139 (01) : 71 - 83
  • [8] Genomic prediction with whole-genome sequence data in intensely selected pig lines
    Ros-Freixedes, Roger
    Johnsson, Martin
    Whalen, Andrew
    Chen, Ching-Yi
    Valente, Bruno D.
    Herring, William O.
    Gorjanc, Gregor
    Hickey, John M.
    GENETICS SELECTION EVOLUTION, 2022, 54 (01)
  • [9] Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction
    Ye, Shaopan
    Gao, Ning
    Zheng, Rongrong
    Chen, Zitao
    Teng, Jinyan
    Yuan, Xiaolong
    Zhang, Hao
    Chen, Zanmou
    Zhang, Xiquan
    Li, Jiaqi
    Zhang, Zhe
    FRONTIERS IN GENETICS, 2019, 10
  • [10] Genomic prediction with whole-genome sequence data in intensely selected pig lines
    Roger Ros-Freixedes
    Martin Johnsson
    Andrew Whalen
    Ching-Yi Chen
    Bruno D. Valente
    William O. Herring
    Gregor Gorjanc
    John M. Hickey
    Genetics Selection Evolution, 54