A Comparison of Gene Set Analysis Methods in Terms of Sensitivity, Prioritization and Specificity

被引:136
作者
Tarca, Adi L. [1 ,2 ]
Bhatti, Gaurav [2 ]
Romero, Roberto [2 ,3 ,4 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
[2] NICHHD, Perinatol Res Branch, NIH, Rockville, MD USA
[3] Univ Michigan, Dept Obstet & Gynecol, Ann Arbor, MI 48109 USA
[4] Michigan State Univ, Dept Epidemiol & Biostat, E Lansing, MI 48824 USA
来源
PLOS ONE | 2013年 / 8卷 / 11期
基金
美国国家卫生研究院;
关键词
EXPRESSION; ENRICHMENT; PATHWAYS; BIOLOGY;
D O I
10.1371/journal.pone.0079217
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identification of functional sets of genes associated with conditions of interest from omics data was first reported in 1999, and since, a plethora of enrichment methods were published for systematic analysis of gene sets collections including Gene Ontology and biological pathways. Despite their widespread usage in reducing the complexity of omics experiment results, their performance is poorly understood. Leveraging the existence of disease specific gene sets in KEGG and Metacore (R) databases, we compared the performance of sixteen methods under relaxed assumptions while using 42 real datasets (over 1,400 samples). Most of the methods ranked high the gene sets designed for specific diseases whenever samples from affected individuals were compared against controls via microarrays. The top methods for gene set prioritization were different from the top ones in terms of sensitivity, and four of the sixteen methods had large false positives rates assessed by permuting the phenotype of the samples. The best overall methods among those that generated reasonably low false positive rates, when permuting phenotypes, were PLAGE, GLOBALTEST, and PADOG. The best method in the category that generated higher than expected false positives was MRGSE.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Multidimensional Gene Set Analysis of Genomic Data
    Montaner, David
    Dopazo, Joaquin
    PLOS ONE, 2010, 5 (04):
  • [42] Integrated gene set analysis for microRNA studies
    Garcia-Garcia, Francisco
    Panadero, Joaquin
    Dopazo, Joaquin
    Montaner, David
    BIOINFORMATICS, 2016, 32 (18) : 2809 - 2816
  • [43] Gene-set Analysis with CGI Information for Differential DNA Methylation Profiling
    Chang, Chia-Wei
    Lu, Tzu-Pin
    She, Chang-Xian
    Feng, Yen-Chen
    Hsiao, Chuhsing Kate
    SCIENTIFIC REPORTS, 2016, 6
  • [44] GeneTrail -: advanced gene set enrichment analysis
    Backes, Christina
    Keller, Andreas
    Kuentzer, Jan
    Kneissl, Benny
    Comtesse, Nicole
    Elnakady, Yasser A.
    Mueller, Rolf
    Meese, Eckart
    Lenhof, Hans-Peter
    NUCLEIC ACIDS RESEARCH, 2007, 35 : W186 - W192
  • [45] A Meta-Analysis Strategy for Gene Prioritization Using Gene Expression, SNP Genotype, and eQTL Data
    Che, Jingmin
    Shin, Miyoung
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [46] Screening Key Genes and Pathways in Glioma Based on Gene Set Enrichment Analysis and Meta-analysis
    Tang, Yanyan
    He, Wenwu
    Wei, Yunfei
    Qu, Zhanli
    Zeng, Jinming
    Qin, Chao
    JOURNAL OF MOLECULAR NEUROSCIENCE, 2013, 50 (02) : 324 - 332
  • [47] Modeling Analysis of Signal Sensitivity and Specificity by Vibrio fischeri LuxR Variants
    Colton, Deanna M.
    Stabb, Eric V.
    Hagen, Stephen J.
    PLOS ONE, 2015, 10 (05):
  • [48] A Unified Mixed Effects Model for Gene Set Analysis of Time Course Microarray Experiments
    Wang, Lily
    Chen, Xi
    Wolfinger, Russell D.
    Franklin, Jeffrey L.
    Coffey, Robert J.
    Zhang, Bing
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01):
  • [49] Impact of DNA microarray data transformation on gene expression analysis - comparison of two normalization methods
    Schmidt, Marcin T.
    Handschuh, Luiza
    Zyprych, Joanna
    Szabelska, Alicja
    Olejnik-Schmidt, Agnieszka K.
    Siatkowski, Idzi
    Figlerowicz, Marek
    ACTA BIOCHIMICA POLONICA, 2011, 58 (04) : 573 - 580
  • [50] Quantitative gene set analysis generalized for repeated measures, confounder adjustment, and continuous covariates
    Turner, Jacob A.
    Bolen, Christopher R.
    Blankenship, Derek M.
    BMC BIOINFORMATICS, 2015, 16