The effect of rare alleles on estimated genomic relationships from whole genome sequence data

被引:34
作者
Eynard, Sonia E. [1 ,2 ,3 ,4 ]
Windig, Jack J. [1 ,4 ]
Leroy, Gregoire [2 ,3 ]
van Binsbergen, Rianne [1 ,5 ]
Calus, Mario P. L. [1 ]
机构
[1] Wageningen UR Livestock Res, Anim Breeding & Genom Ctr, NL-6700 AH Wageningen, Netherlands
[2] AgroParisTech, UMR Genet Anim & Biol Integrat 1313, F-75231 Paris 05, France
[3] INRA, UMR Genet Anim & Biol Integrat 1313, F-78350 Jouy En Josas, France
[4] Wageningen UR, Ctr Genet Resources Netherlands, NL-6700 AA Wageningen, Netherlands
[5] Wageningen UR, Biometris, NL-6700 AA Wageningen, Netherlands
关键词
Whole genome sequence; Additive genetic relationship; Rare variants; Minor allele frequency; Inbreeding; PEDIGREE; CONSERVATION; INFORMATION; POPULATION; ACCURACY; COEFFICIENTS; IMPROVEMENT; CHALLENGES; PREDICTION; IMPUTATION;
D O I
10.1186/s12863-015-0185-0
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%. Results: We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies. Conclusions: Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity.
引用
收藏
页数:12
相关论文
共 35 条
[1]  
[Anonymous], 2015, PSYCH PROCEDURES PER
[2]  
[Anonymous], QUANTATIVE GENETICS
[3]   Long-term genomic improvement - new challenges for population genetics [J].
Bijma, P. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2012, 129 (01) :1-2
[4]   Identification of Mendelian inconsistencies between SNP and pedigree information of sibs [J].
Calus, Mario P. L. ;
Mulder, Han A. ;
Bastiaansen, John W. M. .
GENETICS SELECTION EVOLUTION, 2011, 43
[5]  
CURIECOHEN M, 1982, GENETICS, V100, P339
[6]   Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle [J].
Daetwyler, Hans D. ;
Capitan, Aurelien ;
Pausch, Hubert ;
Stothard, Paul ;
Van Binsbergen, Rianne ;
Brondum, Rasmus F. ;
Liao, Xiaoping ;
Djari, Anis ;
Rodriguez, Sabrina C. ;
Grohs, Cecile ;
Esquerre, Diane ;
Bouchez, Olivier ;
Rossignol, Marie-Noelle ;
Klopp, Christophe ;
Rocha, Dominique ;
Fritz, Sebastien ;
Eggen, Andre ;
Bowman, Phil J. ;
Coote, David ;
Chamberlain, Amanda J. ;
Anderson, Charlotte ;
VanTassell, Curt P. ;
Hulsegge, Ina ;
Goddard, Mike E. ;
Guldbrandtsen, Bernt ;
Lund, Mogens S. ;
Veerkamp, Roel F. ;
Boichard, Didier A. ;
Fries, Ruedi ;
Hayes, Ben J. .
NATURE GENETICS, 2014, 46 (08) :858-865
[7]   Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions [J].
Druet, T. ;
Macleod, I. M. ;
Hayes, B. J. .
HEREDITY, 2014, 112 (01) :39-47
[8]   Marker-based estimates of between and within population kinships for the conservation of genetic diversity [J].
Eding, H ;
Meuwissen, THE .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2001, 118 (03) :141-159
[9]   Effect of marker-data editing on the accuracy of genomic prediction [J].
Edriss, V. ;
Guldbrandtsen, B. ;
Lund, M. S. ;
Su, G. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2013, 130 (02) :128-135
[10]   Consequences for diversity when prioritizing animals for conservation with pedigree or genomic information [J].
Engelsma, K. A. ;
Veerkamp, R. F. ;
Calus, M. P. L. ;
Windig, J. J. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2011, 128 (06) :473-481