Robust and Powerful Tests for Rare Variants Using Fisher's Method to Combine Evidence of Association From Two or More Complementary Tests

被引:66
作者
Derkach, Andriy [2 ]
Lawless, Jerry F. [1 ,3 ]
Sun, Lei [1 ,2 ]
机构
[1] Univ Toronto, Dalla Lana Sch Publ Hlth, Div Biostat, Toronto, ON M5T 3M7, Canada
[2] Univ Toronto, Dept Stat, Toronto, ON M5T 3M7, Canada
[3] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
robust methods; Fisher's method; rare variants; complex traits; next-generation sequencing; 1000 genome project; DISEASE ASSOCIATION; COMMON DISEASES; SEQUENCING DATA;
D O I
10.1002/gepi.21689
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Many association tests have been proposed for rare variants, but the choice of a powerful test is uncertain when there is limited information on the underlying genetic model. Proposed methods use either linear statistics, which are powerful when most variants are causal and have the same direction of effect, or quadratic statistics, which are more powerful in other scenarios. To achieve robustness, it is natural to combine the evidence of association from two or more complementary tests. To this end, we consider the minimum-p and Fisher's methods of combining P-values from linear and quadratic statistics. Extensive simulation studies show that both methods are robust across models with varying proportions of causal, deleterious, and protective rare variants, allele frequencies, and effect sizes. When the majority (>75%) of the causal effects are in the same direction (deleterious or protective), Fisher's method consistently outperforms the minimum-p and the individual linear and quadratic tests, as well as the optimal sequence kernel association test, SKAT-O. When the individual test has moderate power, Fisher's test has improved power for 90% of the 5000 models considered, with >20% relative efficiency gain for 40% of the models. The maximum absolute power loss is 8% for the remaining 10% of the models. An application to the GAW17 quantitative trait Q2 data based on sequence data of the 1000 Genomes Project shows that, compared with linear and quadratic tests, Fisher's test has comparable power for all 13 functional genes and provides the best power for more than half of them.
引用
收藏
页码:110 / 121
页数:12
相关论文
共 26 条
  • [1] Genetic Analysis Workshop 17 mini-exome simulation
    Laura Almasy
    Thomas D Dyer
    Juan Manuel Peralta
    Jack W Kent
    Jac C Charlesworth
    Joanne E Curran
    John Blangero
    [J]. BMC Proceedings, 5 (Suppl 9)
  • [2] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [3] Statistical analysis strategies for association studies involving rare variants
    Bansal, Vikas
    Libiger, Ondrej
    Torkamani, Ali
    Schork, Nicholas J.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (11) : 773 - 785
  • [4] Comparison of Statistical Tests for Disease Association With Rare Variants
    Basu, Saonli
    Pan, Wei
    [J]. GENETIC EPIDEMIOLOGY, 2011, 35 (07) : 606 - 619
  • [5] Uncovering the roles of rare variants in common disease through whole-genome sequencing
    Cirulli, Elizabeth T.
    Goldstein, David B.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (06) : 415 - 425
  • [6] Derkach A, 2012, ARXIV12054079STATME
  • [7] STATISTICAL-INFERENCE PROCEDURES FOR BIVARIATE ARCHIMEDEAN COPULAS
    GENEST, C
    RIVEST, LP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (423) : 1034 - 1043
  • [8] Testing against a high dimensional alternative
    Goeman, JJ
    van de Geer, SA
    van Houwelingen, HC
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2006, 68 : 477 - 493
  • [9] A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants
    Han, Fang
    Pan, Wei
    [J]. HUMAN HEREDITY, 2010, 70 (01) : 42 - 54
  • [10] Comprehensive Approach to Analyzing Rare Genetic Variants
    Hoffmann, Thomas J.
    Marini, Nicholas J.
    Witte, John S.
    [J]. PLOS ONE, 2010, 5 (11):