Underestimated Effect Sizes in GWAS: Fundamental Limitations of Single SNP Analysis for Dichotomous Phenotypes

被引:44
作者
Stringer, Sven [1 ]
Wray, Naomi R. [2 ,3 ]
Kahn, Rene S. [1 ]
Derks, Eske M. [1 ]
机构
[1] Univ Med Ctr Utrecht, Rudolf Magnus Inst Neurosci, Dept Psychiat, Utrecht, Netherlands
[2] Queensland Inst Med Res, Psychiat Genet Lab, Brisbane, Qld 4006, Australia
[3] Univ Queensland, Queensland Brain Inst, Brisbane, Qld, Australia
来源
PLOS ONE | 2011年 / 6卷 / 11期
关键词
GENOME-WIDE ASSOCIATION; INDIVIDUAL GENETIC RISK; LOGISTIC-REGRESSION; STATISTICAL-METHODS; COMPLEX DISEASES; COLLAPSIBILITY; SCHIZOPHRENIA; HERITABILITY; METAANALYSIS; PREDICTION;
D O I
10.1371/journal.pone.0027964
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Complex diseases are often highly heritable. However, for many complex traits only a small proportion of the heritability can be explained by observed genetic variants in traditional genome-wide association (GWA) studies. Moreover, for some of those traits few significant SNPs have been identified. Single SNP association methods test for association at a single SNP, ignoring the effect of other SNPs. We show using a simple multi-locus odds model of complex disease that moderate to large effect sizes of causal variants may be estimated as relatively small effect sizes in single SNP association testing. This underestimation effect is most severe for diseases influenced by numerous risk variants. We relate the underestimation effect to the concept of non-collapsibility found in the statistics literature. As described, continuous phenotypes generated with linear genetic models are not affected by this underestimation effect. Since many GWA studies apply single SNP analysis to dichotomous phenotypes, previously reported results potentially underestimate true effect sizes, thereby impeding identification of true effect SNPs. Therefore, when a multi-locus model of disease risk is assumed, a multi SNP analysis may be more appropriate.
引用
收藏
页数:7
相关论文
共 38 条
  • [31] A genome-wide association study for celiac disease identifies risk variants in the region harboring IL2 and IL21
    van Heel, David A.
    Franke, Lude
    Hunt, Karen A.
    Gwilliam, Rhian
    Zhernakova, Alexandra
    Inouye, Mike
    Wapenaar, Martin C.
    Barnardo, Martin C. N. M.
    Bethel, Graeme
    Holmes, Geoffrey K. T.
    Feighery, Con
    Jewell, Derek
    Kelleher, Dermot
    Kumar, Parveen
    Travis, Simon
    Walters, Julian R. F.
    Sanders, David S.
    Howdle, Peter
    Swift, Jill
    Playford, Raymond J.
    McLaren, William M.
    Mearin, M. Luisa
    Mulder, Chris J.
    McManus, Ross
    McGinnis, Ralph
    Cardon, Lon R.
    Deloukas, Panos
    Wijmenga, Cisca
    [J]. NATURE GENETICS, 2007, 39 (07) : 827 - 829
  • [32] Visscher PM, 2011, MOL PSYCHIATR, P1
  • [33] A simple Bayesian mixture model with a hybrid procedure for genome-wide association studies
    Wei, Yu-Chung
    Wen, Shu-Hui
    Chen, Pei-Chun
    Wang, Chih-Hao
    Hsiao, Chuhsing K.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2010, 18 (08) : 942 - 947
  • [34] Prediction of individual genetic risk of complex disease
    Wray, Naomi R.
    Goddard, Michael E.
    Visscher, Peter M.
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2008, 18 (03) : 257 - 263
  • [35] Prediction of individual genetic risk to disease from genome-wide association studies
    Wray, Naomi R.
    Goddard, Michael E.
    Visscher, Peter M.
    [J]. GENOME RESEARCH, 2007, 17 (10) : 1520 - 1528
  • [36] Wray NR, 2010, MOL PSYCHIATR, V37, P1
  • [37] Common SNPs explain a large proportion of the heritability for human height
    Yang, Jian
    Benyamin, Beben
    McEvoy, Brian P.
    Gordon, Scott
    Henders, Anjali K.
    Nyholt, Dale R.
    Madden, Pamela A.
    Heath, Andrew C.
    Martin, Nicholas G.
    Montgomery, Grant W.
    Goddard, Michael E.
    Visscher, Peter M.
    [J]. NATURE GENETICS, 2010, 42 (07) : 565 - U131
  • [38] The complex interplay among factors that influence allelic association
    Zondervan, KT
    Cardon, LR
    [J]. NATURE REVIEWS GENETICS, 2004, 5 (02) : 89 - U14