Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies

被引:10
|
作者
Zhang, Yu [1 ]
Liu, Jun S. [2 ]
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16803 USA
[2] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
基金
英国惠康基金;
关键词
Genome-wide association study; Multiple comparison; Poisson approximation; MULTIPLE; RECOMBINATION; MAP;
D O I
10.1198/jasa.2011.ap10657
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Genome-wide association studies commonly involve simultaneous tests of millions of single nucleotide polymorphisms (SNP) for disease association. The SNPs in nearby genomic regions, however, are often highly correlated due to linkage disequilibrium (LD, a genetic term for correlation). Simple Bonferonni correction for multiple comparisons is therefore too conservative. Permutation tests, which are often employed in practice, are both computationally expensive for genome-wide studies and limited in their scopes. We present an accurate and computationally efficient method, based on Poisson de-clumping heuristics, for approximating genome-wide significance of SNP associations. Compared with permutation tests and other multiple comparison adjustment approaches, our method computes the most accurate and robust p-value adjustments for millions of correlated comparisons within seconds. We demonstrate analytically that the accuracy and the efficiency of our method are nearly independent of the sample size, the number of SNPs, and the scale of p-values to be adjusted. In addition, our method can be easily adopted to estimate false discovery rate. When applied to genome-wide SNP datasets, we observed highly variable p-value adjustment results evaluated from different genomic regions. The variation in adjustments along the genome, however, are well conserved between the European and the African populations. The p-value adjustments are significantly correlated with LD among SNPs, recombination rates, and SNP densities. Given the large variability of sequence features in the genome, we further discuss a novel approach of using SNP-specific (local) thresholds to detect genome-wide significant associations. This article has supplementary material online.
引用
收藏
页码:846 / 857
页数:12
相关论文
共 50 条
  • [1] Quick approximation of threshold values for genome-wide association studies
    Hao, Zhiyu
    Jiang, Li
    Gao, Jin
    Ye, Jinhua
    Zhao, Jingli
    Li, Shuling
    Yang, Runqing
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (06) : 2217 - 2223
  • [2] BLUPmrMLM: A Fast mrMLM Algorithm in Genome-wide Association Studies
    Li, Hong-Fu
    Wang, Jing-Tian
    Zhao, Qiong
    Zhang, Yuan-Ming
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2024, 22 (03)
  • [3] A Fast and Powerful Empirical Bayes Method for Genome-Wide Association Studies
    Chang, Tianpeng
    Wei, Julong
    Liang, Mang
    An, Bingxing
    Wang, Xiaoqiao
    Zhu, Bo
    Xu, Lingyang
    Zhang, Lupei
    Gao, Xue
    Chen, Yan
    Li, Junya
    Gao, Huijiang
    ANIMALS, 2019, 9 (06):
  • [4] Genome-Wide Association Studies and Beyond
    Witte, John S.
    ANNUAL REVIEW OF PUBLIC HEALTH, VOL 31, 2010, 31 : 9 - 20
  • [5] Replication in Genome-Wide Association Studies
    Kraft, Peter
    Zeggini, Eleftheria
    Ioannidis, John P. A.
    STATISTICAL SCIENCE, 2009, 24 (04) : 561 - 573
  • [6] Genome-wide association studies: a primer
    Corvin, A.
    Craddock, N.
    Sullivan, P. F.
    PSYCHOLOGICAL MEDICINE, 2010, 40 (07) : 1063 - 1077
  • [7] A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
    Howie, Bryan N.
    Donnelly, Peter
    Marchini, Jonathan
    PLOS GENETICS, 2009, 5 (06)
  • [8] Genome-Wide Association Studies of Allergic Diseases
    Tamari, Mayumi
    Tanaka, Shota
    Hirota, Tomomitsu
    ALLERGOLOGY INTERNATIONAL, 2013, 62 (01) : 21 - 28
  • [9] Genome-wide Association Studies of Cancer Predisposition
    Stadler, Zsofia K.
    Vijai, Joseph
    Thom, Peter
    Kirchhoff, Tomas
    Hansen, Nichole A. L.
    Kauff, Noah D.
    Robson, Mark
    Offit, Kenneth
    HEMATOLOGY-ONCOLOGY CLINICS OF NORTH AMERICA, 2010, 24 (05) : 973 - +
  • [10] Moving Beyond Genome-Wide Association Studies
    Glazer, Nicole L.
    CIRCULATION-CARDIOVASCULAR GENETICS, 2011, 4 (01) : 91 - 93