Multiple comparisons in genetic association studies: a hierarchical modeling approach

被引:8
作者
Yi, Nengjun [1 ]
Xu, Shizhong [2 ]
Lou, Xiang-Yang [1 ]
Mallick, Himel [1 ]
机构
[1] Univ Alabama Birmingham, Dept Biostat, Sect Stat Genet, Birmingham, AL 35294 USA
[2] Univ Calif Riverside, Dept Bot & Plant Sci, Riverside, CA 92521 USA
关键词
Bayesian inference; effective number of parameters; effective number of hypothesis tests; generalized linear models; genetic association studies; hierarchical modeling; hierarchical Bonferroni correction; multiple comparisons; FALSE DISCOVERY RATE; BAYESIAN-ANALYSIS; RARE VARIANTS; REGRESSION; COMPLEXITY;
D O I
10.1515/sagmb-2012-0040
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Multiple comparisons or multiple testing has been viewed as a thorny issue in genetic association studies aiming to detect disease-associated genetic variants from a large number of genotyped variants. We alleviate the problem of multiple comparisons by proposing a hierarchical modeling approach that is fundamentally different from the existing methods. The proposed hierarchical models simultaneously fit as many variables as possible and shrink unimportant effects towards zero. Thus, the hierarchical models yield more efficient estimates of parameters than the traditional methods that analyze genetic variants separately, and also coherently address the multiple comparisons problem due to largely reducing the effective number of genetic effects and the number of statistically "significant" effects. We develop a method for computing the effective number of genetic effects in hierarchical generalized linear models, and propose a new adjustment for multiple comparisons, the hierarchical Bonferroni correction, based on the effective number of genetic effects. Our approach not only increases the power to detect disease-associated variants but also controls the Type I error. We illustrate and evaluate our method with real and simulated data sets from genetic association studies. The method has been implemented in our freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/).
引用
收藏
页码:35 / 48
页数:14
相关论文
共 43 条
  • [1] [Anonymous], 2003, Bayesian Data Analysis
  • [2] [Anonymous], 2011, Data analysis using regression and multilevel/hierarchical models
  • [3] [Anonymous], 1996, MULTIPLE COMP THEORY, DOI DOI 10.1201/B15074
  • [4] [Anonymous], 1983, Generalized Linear Models
  • [5] Armagan A., 2010, Biometrika
  • [6] A tutorial on statistical methods for population association studies
    Balding, David J.
    [J]. NATURE REVIEWS GENETICS, 2006, 7 (10) : 781 - 791
  • [7] Benjamini Y, 2001, ANN STAT, V29, P1165
  • [8] Quantitative trait loci analysis using the false discovery rate
    Benjamini, Y
    Yekutieli, D
    [J]. GENETICS, 2005, 171 (02) : 783 - 789
  • [9] A New Measure of the Effective Number of Tests, a Practical Tool for Comparing Families of Non-Independent Significance Tests
    Galwey, Nicholas W.
    [J]. GENETIC EPIDEMIOLOGY, 2009, 33 (07) : 559 - 568
  • [10] A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms
    Gao, Xiaoyi
    Stamier, Joshua
    Martin, Eden R.
    [J]. GENETIC EPIDEMIOLOGY, 2008, 32 (04) : 361 - 369