Improved Heritability Estimation from Genome-wide SNPs

被引:452
作者
Speed, Doug [1 ]
Hemani, Gibran [2 ]
Johnson, Michael R. [3 ]
Balding, David J. [1 ]
机构
[1] UCL, Univ Coll London Genet Inst, London WC1E 6BT, England
[2] Univ Queensland, Diamantina Inst, Brisbane, Qld 4102, Australia
[3] Univ London Imperial Coll Sci Technol & Med, Div Brain Sci, London W6 8RF, England
基金
英国医学研究理事会;
关键词
POPULATION-STRUCTURE; ASSOCIATION; ARCHITECTURE; SUSCEPTIBILITY; PROPORTION;
D O I
10.1016/j.ajhg.2012.10.010
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Estimation of narrow-sense heritability, h(2), from genome-wide SNPs genotyped in unrelated individuals has recently attracted interest and offers several advantages over traditional pedigree-based methods. With the use of this approach, it has been estimated that over half the heritability of human height can be attributed to the similar to 300,000 SNPs on a genome-wide genotyping array. In comparison, only 5%-10% can be explained by SNPs reaching genome-wide significance. We investigated via simulation the validity of several key assumptions underpinning the mixed-model analysis used in SNP-based h(2) estimation. Although we found that the method is reasonably robust to violations of four key assumptions, it can be highly sensitive to uneven linkage disequilibrium (LD) between SNPs: contributions to h(2) are overestimated from causal variants in regions of high LD and are underestimated in regions of low LD. The overall direction of the bias can be up or down depending on the genetic architecture of the trait, but it can be substantial in realistic scenarios. We propose a modified kinship matrix in which SNPs are weighted according to local LD. We show that this correction greatly reduces the bias and increases the precision of h(2) estimates. We demonstrate the impact of our method on the first seven diseases studied by the Wellcome Trust Case Control Consortium. Our LD adjustment revises downward the h(2) estimate for immune-related diseases, as expected because of high LD in the major-histocompatibility region, but increases it for some nonimmune diseases. To calculate our revised kinship matrix, we developed LDAK, software for computing LD-adjusted kinships.
引用
收藏
页码:1011 / 1021
页数:11
相关论文
共 25 条
  • [11] Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs
    Lee, S. Hong
    DeCandia, Teresa R.
    Ripke, Stephan
    Yang, Jian
    Sullivan, Patrick F.
    Goddard, Michael E.
    Keller, Matthew C.
    Visscher, Peter M.
    Wray, Naomi R.
    [J]. NATURE GENETICS, 2012, 44 (03) : 247 - U35
  • [12] Estimating Missing Heritability for Disease from Genome-wide Association Studies
    Lee, Sang Hong
    Wray, Naomi R.
    Goddard, Michael E.
    Visscher, Peter M.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2011, 88 (03) : 294 - 305
  • [13] Improved linear mixed models for genome-wide association studies
    Listgarten, Jennifer
    Lippert, Christoph
    Kadie, Carl M.
    Davidson, Robert I.
    Eskin, Eleazar
    Heckerman, David
    [J]. NATURE METHODS, 2012, 9 (06) : 525 - 526
  • [14] Finding the missing heritability of complex diseases
    Manolio, Teri A.
    Collins, Francis S.
    Cox, Nancy J.
    Goldstein, David B.
    Hindorff, Lucia A.
    Hunter, David J.
    McCarthy, Mark I.
    Ramos, Erin M.
    Cardon, Lon R.
    Chakravarti, Aravinda
    Cho, Judy H.
    Guttmacher, Alan E.
    Kong, Augustine
    Kruglyak, Leonid
    Mardis, Elaine
    Rotimi, Charles N.
    Slatkin, Montgomery
    Valle, David
    Whittemore, Alice S.
    Boehnke, Michael
    Clark, Andrew G.
    Eichler, Evan E.
    Gibson, Greg
    Haines, Jonathan L.
    Mackay, Trudy F. C.
    McCarroll, Steven A.
    Visscher, Peter M.
    [J]. NATURE, 2009, 461 (7265) : 747 - 753
  • [15] Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants
    Park, Ju-Hyun
    Gail, Mitchell H.
    Weinberg, Clarice R.
    Carroll, Raymond J.
    Chung, Charles C.
    Wang, Zhaoming
    Chanock, Stephen J.
    Fraumeni, Joseph F., Jr.
    Chatterjee, Nilanjan
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (44) : 18026 - 18031
  • [16] Estimation of effect size distribution from genome-wide association studies and implications for future discoveries
    Park, Ju-Hyun
    Wacholder, Sholom
    Gail, Mitchell H.
    Peters, Ulrike
    Jacobs, Kevin B.
    Chanock, Stephen J.
    Chatterjee, Nilanjan
    [J]. NATURE GENETICS, 2010, 42 (07) : 570 - U139
  • [17] The allelic architecture of human disease genes: common disease - common variant ... or not?
    Pritchard, JK
    Cox, NJ
    [J]. HUMAN MOLECULAR GENETICS, 2002, 11 (20) : 2417 - 2423
  • [18] PLINK: A tool set for whole-genome association and population-based linkage analyses
    Purcell, Shaun
    Neale, Benjamin
    Todd-Brown, Kathe
    Thomas, Lori
    Ferreira, Manuel A. R.
    Bender, David
    Maller, Julian
    Sklar, Pamela
    de Bakker, Paul I. W.
    Daly, Mark J.
    Sham, Pak C.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (03) : 559 - 575
  • [19] Robinson G. K., 1991, STAT SCI, V6, P15, DOI [10.1214/ss/1177011926, DOI 10.1214/SS/1177011926]
  • [20] Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis
    Stahl, Eli A.
    Wegmann, Daniel
    Trynka, Gosia
    Gutierrez-Achury, Javier
    Do, Ron
    Voight, Benjamin F.
    Kraft, Peter
    Chen, Robert
    Kallberg, Henrik J.
    Kurreeman, Fina A. S.
    Kathiresan, Sekar
    Wijmenga, Cisca
    Gregersen, Peter K.
    Alfredsson, Lars
    Siminovitch, Katherine A.
    Worthington, Jane
    de Bakker, Paul I. W.
    Raychaudhuri, Soumya
    Plenge, Robert M.
    [J]. NATURE GENETICS, 2012, 44 (05) : 483 - +