Comparison of Methods for Competitive Tests of Pathway Analysis

被引:33
作者
Evangelou, Marina [1 ]
Rendon, Augusto [1 ,2 ,3 ]
Ouwehand, Willem H. [2 ,3 ,4 ]
Wernisch, Lorenz [1 ]
Dudbridge, Frank [5 ]
机构
[1] Inst Publ Hlth, MRC, Biostat Unit, Cambridge, England
[2] Univ Cambridge, Dept Haematol, Cambridge, England
[3] Natl Hlth Serv Blood & Transplant, Cambridge, England
[4] Wellcome Trust Sanger Inst, Cambridge, England
[5] London Sch Hyg & Trop Med, Fac Epidemiol & Populat Hlth, London WC1, England
来源
PLOS ONE | 2012年 / 7卷 / 07期
基金
英国医学研究理事会;
关键词
GENOMEWIDE ASSOCIATION; BIOLOGICAL PATHWAYS; MULTIPLE SNPS; GENE;
D O I
10.1371/journal.pone.0041018
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
It has been suggested that pathway analysis can complement single-SNP analysis in exploring genomewide association data. Pathway analysis incorporates the available biological knowledge of genes and SNPs and is expected to improve the chances of revealing the underlying genetic architecture of complex traits. Methods for pathway analysis can be classified as competitive (enrichment) or self-contained (association) according to the hypothesis tested. Although association tests are statistically more powerful than enrichment tests they can be difficult to calibrate because biases in analysis accumulate across multiple SNPs or genes. Furthermore, enrichment tests can be more scientifically relevant than association tests, as they detect pathways with relatively more evidence for association than the remaining genes. Here we show how some well known association tests can be simply adapted to test for enrichment, and compare their performance to some established enrichment tests. We propose versions of the Adaptive Rank Truncated Product (ARTP), Tail Strength Measure and Fisher's combination of p-values for testing the enrichment null hypothesis. We compare the behaviour of these proposed methods with the established Hypergeometric Test and Gene-Set Enrichment Analysis (GSEA). The results of the simulation study show that the modified version of the ARTP method has generally the best performance across the situations considered. The methods were also applied for finding enriched pathways for body mass index (BMI) and platelet function phenotypes. The pathway analysis of BMI identified the Vasoactive Intestinal Peptide pathway as significantly associated with BMI. This pathway has been previously reported as associated with BMI and the risk of obesity. The ARTP method was the method that identified the largest number of enriched pathways across all tested pathway databases and phenotypes. The simulation and data application results are in agreement with previous work on association tests and suggests that the ARTP should be preferred for both enrichment and association testing.
引用
收藏
页数:10
相关论文
共 29 条
  • [1] Analysis of multiple SNPs in a candidate gene or region
    Chapman, Juliet
    Whittaker, John
    [J]. GENETIC EPIDEMIOLOGY, 2008, 32 (06) : 560 - 566
  • [2] On the Utility of Gene Set Methods in Genomewide Association Studies of Quantitative Traits
    Chasman, Daniel I.
    [J]. GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 658 - 668
  • [3] Prioritizing risk pathways: a novel association approach to searching for disease pathways fusing SNPs and pathways
    Chen, Lina
    Zhang, Liangcai
    Zhao, Yan
    Xu, Liangde
    Shang, Yukui
    Wang, Qian
    Li, Wan
    Wang, Hong
    Li, Xia
    [J]. BIOINFORMATICS, 2009, 25 (02) : 237 - 242
  • [4] Reactome: a database of reactions, pathways and biological processes
    Croft, David
    O'Kelly, Gavin
    Wu, Guanming
    Haw, Robin
    Gillespie, Marc
    Matthews, Lisa
    Caudy, Michael
    Garapati, Phani
    Gopinath, Gopal
    Jassal, Bijay
    Jupe, Steven
    Kalatskaya, Irina
    Mahajan, Shahana
    May, Bruce
    Ndegwa, Nelson
    Schmidt, Esther
    Shamovsky, Veronica
    Yung, Christina
    Birney, Ewan
    Hermjakob, Henning
    D'Eustachio, Peter
    Stein, Lincoln
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D691 - D697
  • [5] A Critical Evaluation of Genomic Control Methods for Genetic Association Studies
    Dadd, Tony
    Weale, Michael E.
    Lewis, Cathryn M.
    [J]. GENETIC EPIDEMIOLOGY, 2009, 33 (04) : 290 - 298
  • [6] Day N, 1999, BRIT J CANCER, V80, P95
  • [7] Rank truncated product of P-values, with application to genomewide association scans
    Dudbridge, F
    Koeleman, BPC
    [J]. GENETIC EPIDEMIOLOGY, 2003, 25 (04) : 360 - 366
  • [8] Using Genome-Wide Pathway Analysis to Unravel the Etiology of Complex Diseases
    Elbers, Clara C.
    van Eijk, Kristel R.
    Franke, Lude
    Mulder, Flip
    van der Schouw, Yvonne T.
    Wijmenga, Cisca
    Onland-Moret, N. Charlotte
    [J]. GENETIC EPIDEMIOLOGY, 2009, 33 (05) : 419 - 431
  • [9] Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
    Fridley, Brooke L.
    Jenkins, Gregory D.
    Biernacka, Joanna M.
    [J]. PLOS ONE, 2010, 5 (09): : 1 - 9
  • [10] Resampling-based multiple testing for microarray data analysis
    Ge, YC
    Dudoit, S
    Speed, TP
    [J]. TEST, 2003, 12 (01) : 1 - 77