On set-based association tests: Insights from a regression using summary statistics

被引:2
作者
Zhao, Yanyan [1 ]
Sun, Lei [1 ,2 ]
机构
[1] Univ Toronto, Dept Stat Sci, Toronto, ON M5S 3G3, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Div Biostat, Toronto, ON M5T 3M7, Canada
来源
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE | 2021年 / 49卷 / 03期
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
Correlation; regression; set‐ based tests; sparse alternatives; summary statistics; RARE-VARIANT ASSOCIATION; HIGHER CRITICISM; 2-SAMPLE TEST; METAANALYSIS; POWER; SNPS;
D O I
10.1002/cjs.11584
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Motivated by, but not limited to, association analyses of multiple genetic variants, we propose here a summary statistics-based regression framework. The proposed method requires only variant-specific summary statistics, and it unifies earlier methods based on individual-level data as special cases. The resulting score test statistic, derived from a linear mixed-effect regression model, inherently transforms the variant-specific statistics using the precision matrix to improve power for detecting sparse alternatives. Furthermore, the proposed method can incorporate additional variant-specific information with ease, facilitating omic-data integration. We study the asymptotic properties of the proposed tests under the null and alternatives, and we investigate efficient P-value calculation in finite samples. Finally, we provide supporting empirical evidence from extensive simulation studies and two applications.
引用
收藏
页码:754 / 770
页数:17
相关论文
共 32 条
  • [1] Genetic Analysis Workshop 17 mini-exome simulation
    Laura Almasy
    Thomas D Dyer
    Juan Manuel Peralta
    Jack W Kent
    Jac C Charlesworth
    Joanne E Curran
    John Blangero
    [J]. BMC Proceedings, 5 (Suppl 9)
  • [2] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [3] [Anonymous], 2016, Econometrics Journal
  • [4] The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies
    Barnett, Ian
    Mukherjee, Rajarshi
    Lin, Xihong
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 64 - 76
  • [5] A comparison of statistical methods for meta-analysis
    Brockwell, SE
    Gordon, IR
    [J]. STATISTICS IN MEDICINE, 2001, 20 (06) : 825 - 840
  • [6] Two-sample test of high dimensional means under dependence
    Cai, T. Tony
    Liu, Weidong
    Xia, Yin
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (02) : 349 - 372
  • [7] MR-LDP: a two-sample Mendelian randomization for GWAS summary statistics accounting for linkage disequilibrium and horizontal pleiotropy
    Cheng, Qing
    Yang, Yi
    Shi, Xingjie
    Yeung, Kar-Fu
    Yang, Can
    Peng, Heng
    Liu, Jin
    [J]. NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (02)
  • [8] Pooled Association Tests for Rare Genetic Variants: A Review and Some New Results
    Derkach, Andriy
    Lawless, Jerry F.
    Sun, Lei
    [J]. STATISTICAL SCIENCE, 2014, 29 (02) : 302 - 321
  • [9] Robust and Powerful Tests for Rare Variants Using Fisher's Method to Combine Evidence of Association From Two or More Complementary Tests
    Derkach, Andriy
    Lawless, Jerry F.
    Sun, Lei
    [J]. GENETIC EPIDEMIOLOGY, 2013, 37 (01) : 110 - 121
  • [10] Higher criticism for detecting sparse heterogeneous mixtures
    Donoho, D
    Jin, JS
    [J]. ANNALS OF STATISTICS, 2004, 32 (03) : 962 - 994