On Sample Size and Power Calculation for Variant Set-Based Association Tests

被引:5
|
作者
Wu, Baolin [1 ]
Pankow, James S. [2 ]
机构
[1] Univ Minnesota, Sch Publ Hlth, Div Biostat, A460 Mayo Bldg,MMC 303, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Sch Publ Hlth, Div Epidemiol & Community Hlth, Minneapolis, MN 55455 USA
基金
美国国家卫生研究院;
关键词
Sample size; sequencing study; sequence kernel association test; FASTING GLUCOSE; COMMON DISEASES; KERNEL METHODS; RARE VARIANTS; HUMAN GENOME; SIMULATION; IMPACT; MODEL;
D O I
10.1111/ahg.12147
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Sample size and power calculations are an important part of designing new sequence-based association studies. The recently developed SEQPower and SPS programs adopted computationally intensive Monte Carlo simulations to empirically estimate power for a series of variant set association (VSA) test methods including the sequence kernel association test (SKAT). It is desirable to develop methods that can quickly and accurately compute power without intensive Monte Carlo simulations. We will show that the computed power for SKAT based on the existing analytical approach could be inflated especially for small significance levels, which are often of primary interest for large-scale whole genome and exome sequencing projects. We propose a new (2)-approximation-based approach to accurately and efficiently compute sample size and power. In addition, we propose and implement a more accurate exact method to compute power, which is more efficient than the Monte Carlo approach though generally involves more computations than the (2) approximation method. The exact approach could produce very accurate results and be used to verify alternative approximation approaches. We implement the proposed methods in publicly available R programs that can be readily adapted when planning sequencing projects.
引用
收藏
页码:136 / 143
页数:8
相关论文
共 50 条
  • [41] SEAGLE: A Scalable Exact Algorithm for Large-Scale Set-Based Gene-Environment Interaction Tests in Biobank Data
    Chi, Jocelyn T.
    Ipsen, Ilse C. F.
    Hsiao, Tzu-Hung
    Lin, Ching-Heng
    Wang, Li-San
    Lee, Wan-Ping
    Lu, Tzu-Pin
    Tzeng, Jung-Ying
    FRONTIERS IN GENETICS, 2021, 12
  • [42] Nomogram for sample size calculation in assessing validity of a new method based on a regression line
    Hong, Hyunsook
    Hahn, Seokyoung
    Kim, Ho
    Choi, Yunhee
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (16) : 5900 - 5909
  • [43] The power comparison of the haplotype-based collapsing tests and the variant-based collapsing tests for detecting rare variants in pedigrees
    Wei Guo
    Yin Yao Shugart
    BMC Genomics, 15
  • [44] The power comparison of the haplotype-based collapsing tests and the variant-based collapsing tests for detecting rare variants in pedigrees
    Guo, Wei
    Shugart, Yin Yao
    BMC GENOMICS, 2014, 15
  • [45] Sample size calculation for clinical trials with correlated count measurements based on the negative binomial distribution
    Li, Dateng
    Zhang, Song
    Cao, Jing
    STATISTICS IN MEDICINE, 2019, 38 (28) : 5413 - 5427
  • [46] Sample size calculation based on precision for pilot sequential multiple assignment randomized trial (SMART)
    Yan, Xiaoxi
    Ghosh, Palash
    Chakraborty, Bibhas
    BIOMETRICAL JOURNAL, 2021, 63 (02) : 247 - 271
  • [47] A new approach for sample size calculation in cost-effectiveness studies based on value of information
    Bader, Clement
    Cossin, Sebastien
    Maillard, Aline
    Benard, Antoine
    BMC MEDICAL RESEARCH METHODOLOGY, 2018, 18
  • [48] A new approach for sample size calculation in cost-effectiveness studies based on value of information
    Clément Bader
    Sébastien Cossin
    Aline Maillard
    Antoine Bénard
    BMC Medical Research Methodology, 18
  • [49] Power and sample size calculation for the win odds test: application to an ordinal endpoint in COVID-19 trials
    Gasparyan, Samvel B.
    Kowalewski, Elaine K.
    Folkvaljon, Folke
    Bengtsson, Olof
    Buenconsejo, Joan
    Adler, John
    Koch, Gary G.
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2021, 31 (06) : 765 - 787
  • [50] A Comparison of Sample Size and Power in Case-Only Association Studies of Gene-Environment Interaction
    Clarke, Geraldine M.
    Morris, Andrew P.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 171 (04) : 498 - 505