Gene set analysis of genome-wide association studies: Methodological issues and perspectives

被引:143
作者
Wang, Lily [1 ]
Jia, Peilin [2 ,3 ]
Wolfinger, Russell D. [4 ]
Chen, Xi [1 ]
Zhao, Zhongming [2 ,3 ,5 ]
机构
[1] Vanderbilt Univ, Dept Biostat, Sch Med, Div Canc Biostat, Nashville, TN 37232 USA
[2] Vanderbilt Univ, Dept Biomed Informat, Sch Med, Nashville, TN 37232 USA
[3] Vanderbilt Univ, Dept Psychiat, Sch Med, Nashville, TN 37232 USA
[4] SAS Inst Inc, Cary, NC 27513 USA
[5] Vanderbilt Univ, Dept Canc Biol, Sch Med, Nashville, TN 37232 USA
关键词
Genome-wide association study; Gene set; Pathway; Gene-set enrichment analysis; Statistical significance; Complex disease; PATHWAY ANALYSIS; ENRICHMENT ANALYSIS; STATISTICAL-METHODS; TRUNCATED PRODUCT; FALSE DISCOVERY; DISEASE; SNP; KNOWLEDGE; COMMON; POLYMORPHISMS;
D O I
10.1016/j.ygeno.2011.04.006
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Recent studies have demonstrated that gene set analysis, which tests disease association with genetic variants in a group of functionally related genes, is a promising approach for analyzing and interpreting genome-wide association studies (GWAS) data. These approaches aim to increase power by combining association signals from multiple genes in the same gene set. I In addition, gene set analysis can also shed more light on the biological processes underlying complex diseases. However, current approaches for gene set analysis are still in an early stage of development in that analysis results are often prone to sources of bias, including gene set size and gene length, linkage disequilibrium patterns and the presence of overlapping genes. In this paper, we provide an in-depth review of the gene set analysis procedures, along with parameter choices and the particular methodology challenges at each stage. In addition to providing a survey of recently developed tools, we also classify the analysis methods into larger categories and discuss their strengths and limitations. In the last section, we outline several important areas for improving the analytical strategies in gene set analysis. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 94 条
  • [71] SNPtoGO:: characterizing SNPs by enriched GO terms
    Schwarz, Daniel F.
    Haedicke, Oliver
    Erdmann, Jeanette
    Ziegler, Andreas
    Bayer, Daniel
    Moeller, Steffen
    [J]. BIOINFORMATICS, 2008, 24 (01) : 146 - 148
  • [72] Common Inherited Variation in Mitochondrial Genes Is Not Enriched for Associations with Type 2 Diabetes or Related Glycemic Traits
    Segre, Ayellet V.
    Groop, Leif
    Mootha, Vamsi K.
    Daly, Mark J.
    Altshuler, David
    [J]. PLOS GENETICS, 2010, 6 (08):
  • [73] Imputation-based analysis of association studies: Candidate regions and quantitative traits
    Servin, Bertrand
    Stephens, Matthew
    [J]. PLOS GENETICS, 2007, 3 (07): : 1296 - 1308
  • [75] AN IMPROVED BONFERRONI PROCEDURE FOR MULTIPLE TESTS OF SIGNIFICANCE
    SIMES, RJ
    [J]. BIOMETRIKA, 1986, 73 (03) : 751 - 754
  • [76] Bayesian statistical methods for genetic association studies
    Stephens, Matthew
    Balding, David J.
    [J]. NATURE REVIEWS GENETICS, 2009, 10 (10) : 681 - 690
  • [77] Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
    Subramanian, A
    Tamayo, P
    Mootha, VK
    Mukherjee, S
    Ebert, BL
    Gillette, MA
    Paulovich, A
    Pomeroy, SL
    Golub, TR
    Lander, ES
    Mesirov, JP
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (43) : 15545 - 15550
  • [78] Gene-environment-wide association studies: emerging approaches
    Thomas, Duncan
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (04) : 259 - 272
  • [79] Discovering statistically significant pathways in expression profiling studies
    Tian, L
    Greenberg, SA
    Kong, SW
    Altschuler, J
    Kohane, IS
    Park, PJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (38) : 13544 - 13549
  • [80] Tintle Nathan L, 2009, BMC Proc, V3 Suppl 7, pS96