On Sample Size and Power Calculation for Variant Set-Based Association Tests

被引:5
|
作者
Wu, Baolin [1 ]
Pankow, James S. [2 ]
机构
[1] Univ Minnesota, Sch Publ Hlth, Div Biostat, A460 Mayo Bldg,MMC 303, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Sch Publ Hlth, Div Epidemiol & Community Hlth, Minneapolis, MN 55455 USA
基金
美国国家卫生研究院;
关键词
Sample size; sequencing study; sequence kernel association test; FASTING GLUCOSE; COMMON DISEASES; KERNEL METHODS; RARE VARIANTS; HUMAN GENOME; SIMULATION; IMPACT; MODEL;
D O I
10.1111/ahg.12147
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Sample size and power calculations are an important part of designing new sequence-based association studies. The recently developed SEQPower and SPS programs adopted computationally intensive Monte Carlo simulations to empirically estimate power for a series of variant set association (VSA) test methods including the sequence kernel association test (SKAT). It is desirable to develop methods that can quickly and accurately compute power without intensive Monte Carlo simulations. We will show that the computed power for SKAT based on the existing analytical approach could be inflated especially for small significance levels, which are often of primary interest for large-scale whole genome and exome sequencing projects. We propose a new (2)-approximation-based approach to accurately and efficiently compute sample size and power. In addition, we propose and implement a more accurate exact method to compute power, which is more efficient than the Monte Carlo approach though generally involves more computations than the (2) approximation method. The exact approach could produce very accurate results and be used to verify alternative approximation approaches. We implement the proposed methods in publicly available R programs that can be readily adapted when planning sequencing projects.
引用
收藏
页码:136 / 143
页数:8
相关论文
共 50 条
  • [31] Set-Based Group Search Optimizer for Stochastic Many-Objective Optimal Power Flow
    Zheng, Jiehui
    Tao, Mingming
    Li, Zhigang
    Wu, Qinghua
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [32] A multistep approach to single nucleotide polymorphism–set analysis: an evaluation of power and type I error of gene-based tests of association after pathway-based association tests
    Alessandra Valcarcel
    Kelsey Grinde
    Kaitlyn Cook
    Alden Green
    Nathan Tintle
    BMC Proceedings, 10 (Suppl 7)
  • [33] Set-Based Rare Variant Expression Quantitative Trait Loci in Blood and Brain from Alzheimer Disease Study Participants
    Patel, Devanshi
    Zhang, Xiaoling
    Farrell, John J.
    Lunetta, Kathryn L.
    Farrer, Lindsay A.
    GENES, 2021, 12 (03)
  • [34] Power calculation for causal inference in social science: sample size and minimum detectable effect determination
    Djimeu, Eric W.
    Houndolo, Deo-Gracias
    JOURNAL OF DEVELOPMENT EFFECTIVENESS, 2016, 8 (04) : 508 - 527
  • [35] Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies
    Chen, Han
    Huffman, Jennifer E.
    Brody, Jennifer A.
    Wang, Chaolong
    Lee, Seunggeun
    Li, Zilin
    Gogarten, Stephanie M.
    Sofer, Tamar
    Bielak, Lawrence F.
    Bis, Joshua C.
    Blangero, John
    Bowler, Russell P.
    Cade, Brian E.
    Cho, Michael H.
    Correa, Adolfo
    Curran, Joanne E.
    de Vries, Paul S.
    Glahn, David C.
    Guo, Xiuqing
    Johnson, Andrew D.
    Kardia, Sharon
    Kooperberg, Charles
    Lewis, Joshua P.
    Liu, Xiaoming
    Mathias, Rasika A.
    Mitchell, Braxton D.
    O'Connell, Jeffrey R.
    Peyser, Patricia A.
    Post, Wendy S.
    Reiner, Alex P.
    Rich, Stephen S.
    Rotter, Jerome I.
    Silverman, Edwin K.
    Smith, Jennifer A.
    Vasan, Ramachandran S.
    Wilson, James G.
    Yanek, Lisa R.
    Redline, Susan
    Smith, Nicholas L.
    Boerwinkle, Eric
    Borecki, Ingrid B.
    Cupples, L. Adrienne
    Laurie, Cathy C.
    Morrison, Alanna C.
    Rice, Kenneth M.
    Lin, Xihong
    AMERICAN JOURNAL OF HUMAN GENETICS, 2019, 104 (02) : 260 - 274
  • [36] Power and sample size calculation for neuroimaging studies by non-central random field theory
    Hayasaka, Satoru
    Peiffer, Ann M.
    Hugenschmidt, Christina E.
    Laurienti, Paul J.
    NEUROIMAGE, 2007, 37 (03) : 721 - 730
  • [37] Tutorial on Biostatistics: Sample Size and Power Calculation for Ophthalmic Studies With Correlated Binary Eye Outcomes
    Ying, Gui-Shuang
    Glynn, Robert J.
    Rosner, Bernard
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (08)
  • [38] STATISTICAL TESTS OPTIMALLY MEETING CERTAIN FUZZY REQUIREMENTS ON THE POWER FUNCTION AND ON THE SAMPLE-SIZE
    ARNOLD, BF
    FUZZY SETS AND SYSTEMS, 1995, 75 (03) : 365 - 372
  • [39] Extending power and sample size approaches developed for McNemar's procedure to general sign tests
    Hoover, DR
    INTERNATIONAL STATISTICAL REVIEW, 2005, 73 (01) : 103 - 110
  • [40] Trend tests for case-control studies of genetic markers: Power, sample size and robustness
    Freidlin, B
    Zheng, G
    Li, ZH
    Gastwirth, JL
    HUMAN HEREDITY, 2002, 53 (03) : 146 - 152