Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies

被引:10
|
作者
Zhang, Yu [1 ]
Liu, Jun S. [2 ]
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16803 USA
[2] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
基金
英国惠康基金;
关键词
Genome-wide association study; Multiple comparison; Poisson approximation; MULTIPLE; RECOMBINATION; MAP;
D O I
10.1198/jasa.2011.ap10657
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Genome-wide association studies commonly involve simultaneous tests of millions of single nucleotide polymorphisms (SNP) for disease association. The SNPs in nearby genomic regions, however, are often highly correlated due to linkage disequilibrium (LD, a genetic term for correlation). Simple Bonferonni correction for multiple comparisons is therefore too conservative. Permutation tests, which are often employed in practice, are both computationally expensive for genome-wide studies and limited in their scopes. We present an accurate and computationally efficient method, based on Poisson de-clumping heuristics, for approximating genome-wide significance of SNP associations. Compared with permutation tests and other multiple comparison adjustment approaches, our method computes the most accurate and robust p-value adjustments for millions of correlated comparisons within seconds. We demonstrate analytically that the accuracy and the efficiency of our method are nearly independent of the sample size, the number of SNPs, and the scale of p-values to be adjusted. In addition, our method can be easily adopted to estimate false discovery rate. When applied to genome-wide SNP datasets, we observed highly variable p-value adjustment results evaluated from different genomic regions. The variation in adjustments along the genome, however, are well conserved between the European and the African populations. The p-value adjustments are significantly correlated with LD among SNPs, recombination rates, and SNP densities. Given the large variability of sequence features in the genome, we further discuss a novel approach of using SNP-specific (local) thresholds to detect genome-wide significant associations. This article has supplementary material online.
引用
收藏
页码:846 / 857
页数:12
相关论文
共 50 条
  • [31] Genome-wide association studies for thyroid physiology and diseases
    Narumi, Satoshi
    ENDOCRINE JOURNAL, 2023, 70 (01) : 9 - 17
  • [32] Genome-wide association studies in type 1 diabetes
    Struan F.A. Grant
    Hakon Hakonarson
    Current Diabetes Reports, 2009, 9 : 157 - 163
  • [33] Analysis of Corrections Methods in Genome-Wide Association Studies
    Zheng, Ming
    Zhuo, Mugui
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, INFORMATION AND MECHANICAL ENGINEERING (EMIM 2017), 2017, 76 : 439 - 442
  • [34] Genome-Wide Association Studies of CKD and Related Traits
    Tin, Adrienne
    Kottgen, Anna
    CLINICAL JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2020, 15 (11): : 1643 - 1656
  • [35] Genetic variations and risk of placental abruption: A genome-wide association study and meta-analysis of genome-wide association studies
    Workalemahu, Tsegaselassie
    Enquobahrie, Daniel A.
    Gelaye, Bizu
    Sanchez, Sixto E.
    Garcia, Pedro J.
    Tekola-Ayele, Fasil
    Hajat, Anjum
    Thornton, Timothy A.
    Ananth, Cande V.
    Williams, Michelle A.
    PLACENTA, 2018, 66 : 8 - 16
  • [36] Robust Reference Powered Association Test of Genome-Wide Association Studies
    Wang, Yi
    Li, Yi
    Hao, Meng
    Liu, Xiaoyu
    Zhang, Menghan
    Wang, Jiucun
    Xiong, Momiao
    Shugart, Yin Yao
    Jin, Li
    FRONTIERS IN GENETICS, 2019, 10
  • [37] Two-stage association tests for genome-wide association studies based on family data with arbitrary family structure
    Feng, Tao
    Zhang, Shuanglin
    Sha, Qiuying
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2007, 15 (11) : 1169 - 1175
  • [38] Two-stage association tests for genome-wide association studies based on family data with arbitrary family structure
    Tao Feng
    Shuanglin Zhang
    Qiuying Sha
    European Journal of Human Genetics, 2007, 15 : 1169 - 1175
  • [39] Statistical analysis for genome-wide association study
    Ping Zeng
    Yang Zhao
    Cheng Qian
    Liwei Zhang
    Ruyang Zhang
    Jianwei Gou
    Jin Liu
    Liya Liu
    Feng Chen
    The Journal of Biomedical Research, 2015, 29 (04) : 285 - 297
  • [40] Statistical analysis for genome-wide association study
    Zeng, Ping
    Zhao, Yang
    Qian, Cheng
    Zhang, Liwei
    Zhang, Ruyang
    Gou, Jianwei
    Liu, Jin
    Liu, Liya
    Chen, Feng
    JOURNAL OF BIOMEDICAL RESEARCH, 2015, 29 (04): : 285 - 297