On Sample Size and Power Calculation for Variant Set-Based Association Tests

被引:5
|
作者
Wu, Baolin [1 ]
Pankow, James S. [2 ]
机构
[1] Univ Minnesota, Sch Publ Hlth, Div Biostat, A460 Mayo Bldg,MMC 303, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Sch Publ Hlth, Div Epidemiol & Community Hlth, Minneapolis, MN 55455 USA
基金
美国国家卫生研究院;
关键词
Sample size; sequencing study; sequence kernel association test; FASTING GLUCOSE; COMMON DISEASES; KERNEL METHODS; RARE VARIANTS; HUMAN GENOME; SIMULATION; IMPACT; MODEL;
D O I
10.1111/ahg.12147
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Sample size and power calculations are an important part of designing new sequence-based association studies. The recently developed SEQPower and SPS programs adopted computationally intensive Monte Carlo simulations to empirically estimate power for a series of variant set association (VSA) test methods including the sequence kernel association test (SKAT). It is desirable to develop methods that can quickly and accurately compute power without intensive Monte Carlo simulations. We will show that the computed power for SKAT based on the existing analytical approach could be inflated especially for small significance levels, which are often of primary interest for large-scale whole genome and exome sequencing projects. We propose a new (2)-approximation-based approach to accurately and efficiently compute sample size and power. In addition, we propose and implement a more accurate exact method to compute power, which is more efficient than the Monte Carlo approach though generally involves more computations than the (2) approximation method. The exact approach could produce very accurate results and be used to verify alternative approximation approaches. We implement the proposed methods in publicly available R programs that can be readily adapted when planning sequencing projects.
引用
收藏
页码:136 / 143
页数:8
相关论文
共 50 条
  • [1] SPS: A Simulation Tool for Calculating Power of Set-Based Genetic Association Tests
    Li, Jiang
    Sham, Pak Chung
    Song, Youqiang
    Li, Miaoxin
    GENETIC EPIDEMIOLOGY, 2015, 39 (05) : 395 - 397
  • [2] An Exponential Combination Procedure for Set-Based Association Tests in Sequencing Studies
    Chen, Lin S.
    Hsu, Li
    Gamazon, Eric R.
    Cox, Nancy J.
    Nicolae, Dan L.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (06) : 977 - 986
  • [3] Sample size, power concepts and sample size calculation
    Kilic, Selim
    JOURNAL OF MOOD DISORDERS, 2012, 2 (03) : 140 - 142
  • [4] Exemplary data set sample size calculation for Wilcoxon-Mann-Whitney tests
    Divine, George
    Kapke, Alissa
    Havstad, Suzanne
    Joseph, Christine L. M.
    STATISTICS IN MEDICINE, 2010, 29 (01) : 108 - 115
  • [5] Sample size calculation based on efficient unconditional tests for clinical trials with historical controls
    Shan, Guogen
    Moonie, Sheniz
    Shen, Jay
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2016, 26 (02) : 240 - 249
  • [6] Exact calculation of power and sample size in bioequivalence studies using two one-sided tests
    Shen, Meiyu
    Russek-Cohen, Estelle
    Slud, Eric V.
    PHARMACEUTICAL STATISTICS, 2015, 14 (02) : 95 - 101
  • [7] Power and sample size calculation for the additive hazard model
    Su, Pei-Fang
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2017, 27 (04) : 571 - 583
  • [8] Real world scenarios in rare variant association analysis: the impact of imbalance and sample size on the power in silico
    Zhang, Xinyuan
    Basile, Anna O.
    Pendergrass, Sarah A.
    Ritchie, Marylyn D.
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [9] Sample Size Calculation in Genetic Association Studies: A Practical Approach
    Politi, Cristina
    Roumeliotis, Stefanos
    Tripepi, Giovanni
    Spoto, Belinda
    LIFE-BASEL, 2023, 13 (01):
  • [10] Power analysis and sample size estimation for sequence-based association studies
    Wang, Gao T.
    Li, Biao
    Santos-Cortez, Regie P. Lyn
    Peng, Bo
    Leal, Suzanne M.
    BIOINFORMATICS, 2014, 30 (16) : 2377 - 2378