A Comparison of Gene Set Analysis Methods in Terms of Sensitivity, Prioritization and Specificity

被引:136
|
作者
Tarca, Adi L. [1 ,2 ]
Bhatti, Gaurav [2 ]
Romero, Roberto [2 ,3 ,4 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
[2] NICHHD, Perinatol Res Branch, NIH, Rockville, MD USA
[3] Univ Michigan, Dept Obstet & Gynecol, Ann Arbor, MI 48109 USA
[4] Michigan State Univ, Dept Epidemiol & Biostat, E Lansing, MI 48824 USA
来源
PLOS ONE | 2013年 / 8卷 / 11期
基金
美国国家卫生研究院;
关键词
EXPRESSION; ENRICHMENT; PATHWAYS; BIOLOGY;
D O I
10.1371/journal.pone.0079217
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identification of functional sets of genes associated with conditions of interest from omics data was first reported in 1999, and since, a plethora of enrichment methods were published for systematic analysis of gene sets collections including Gene Ontology and biological pathways. Despite their widespread usage in reducing the complexity of omics experiment results, their performance is poorly understood. Leveraging the existence of disease specific gene sets in KEGG and Metacore (R) databases, we compared the performance of sixteen methods under relaxed assumptions while using 42 real datasets (over 1,400 samples). Most of the methods ranked high the gene sets designed for specific diseases whenever samples from affected individuals were compared against controls via microarrays. The top methods for gene set prioritization were different from the top ones in terms of sensitivity, and four of the sixteen methods had large false positives rates assessed by permuting the phenotype of the samples. The best overall methods among those that generated reasonably low false positive rates, when permuting phenotypes, were PLAGE, GLOBALTEST, and PADOG. The best method in the category that generated higher than expected false positives was MRGSE.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Gene set analysis methods: a systematic comparison
    Mathur, Ravi
    Rotroff, Daniel
    Ma, Jun
    Shojaie, Ali
    Motsinger-Reif, Alison
    BIODATA MINING, 2018, 11
  • [2] Size matters: how sample size affects the reproducibility and specificity of gene set analysis
    Maleki, Farhad
    Ovens, Katie
    McQuillan, Ian
    Kusalik, Anthony J.
    HUMAN GENOMICS, 2019, 13 (Suppl 1) : 42
  • [3] Sensitivity analysis of gene ranking methods in phenotype prediction
    deAndres-Galiana, Enrique J.
    Fernandez-Martinez, Juan L.
    Sonis, Stephen T.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 : 255 - 264
  • [4] Assessment of gene set analysis methods based on microarray data
    Alavi-Majd, Hamid
    Khodakarim, Soheila
    Zayeri, Farid
    Rezaei-Tavirani, Mostafa
    Tabatabaei, Seyyed Mohammad
    Heydarpour-Meymeh, Maryam
    GENE, 2014, 534 (02) : 383 - 389
  • [5] Statistical and Biological Evaluation of Different Gene Set Analysis Methods
    Cao, Wenjun
    Li, Yunming
    Liu, Danhong
    Chen, Changsheng
    Xu, Yongyong
    2011 INTERNATIONAL CONFERENCE ON ENVIRONMENT SCIENCE AND BIOTECHNOLOGY (ICESB 2011), 2011, 8 : 693 - 699
  • [6] Gene set analysis methods: statistical models and methodological differences
    Maciejewski, Henryk
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (04) : 504 - 518
  • [7] Characterization of Radiotherapy Sensitivity Genes by Comparative Gene Set Enrichment Analysis
    Zhu, Min
    Li, Xiaolai
    Wang, Shujie
    Guo, Wei
    Li, Xueling
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 205 - 216
  • [8] Gene Set Overlap: An Impediment to Achieving High Specificity in Over-representation Analysis
    Maleki, Farhad
    Kusalik, Anthony J.
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3 (BIOINFORMATICS), 2019, : 182 - 193
  • [9] Effect of the absolute statistic on gene-sampling gene-set analysis methods
    Nam, Dougu
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (03) : 1248 - 1260
  • [10] Comparison of Univariate and Multivariate Gene Set Analysis in Acute Lymphoblastic Leukemia
    Soheila, Khodakarim
    Hamid, AlaviMajd
    Farid, Zayeri
    Mostafa, Rezaei-Tavirani
    Nasrin, Dehghan-Nayeri
    Syyed-Mohammad, Tabatabaee
    Vahide, Tajalli
    ASIAN PACIFIC JOURNAL OF CANCER PREVENTION, 2013, 14 (03) : 1629 - 1633