A Comparison of Gene Set Analysis Methods in Terms of Sensitivity, Prioritization and Specificity

被引：136

作者：

Tarca, Adi L. ^{[1
,2
]}

Bhatti, Gaurav ^{[2
]}

Romero, Roberto ^{[2
,3
,4
]}

机构：

[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA

[2] NICHHD, Perinatol Res Branch, NIH, Rockville, MD USA

[3] Univ Michigan, Dept Obstet & Gynecol, Ann Arbor, MI 48109 USA

[4] Michigan State Univ, Dept Epidemiol & Biostat, E Lansing, MI 48824 USA

来源：

PLOS ONE | 2013年 / 8卷 / 11期

基金：

美国国家卫生研究院;

关键词：

EXPRESSION; ENRICHMENT; PATHWAYS; BIOLOGY;

D O I：

10.1371/journal.pone.0079217

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Identification of functional sets of genes associated with conditions of interest from omics data was first reported in 1999, and since, a plethora of enrichment methods were published for systematic analysis of gene sets collections including Gene Ontology and biological pathways. Despite their widespread usage in reducing the complexity of omics experiment results, their performance is poorly understood. Leveraging the existence of disease specific gene sets in KEGG and Metacore (R) databases, we compared the performance of sixteen methods under relaxed assumptions while using 42 real datasets (over 1,400 samples). Most of the methods ranked high the gene sets designed for specific diseases whenever samples from affected individuals were compared against controls via microarrays. The top methods for gene set prioritization were different from the top ones in terms of sensitivity, and four of the sixteen methods had large false positives rates assessed by permuting the phenotype of the samples. The best overall methods among those that generated reasonably low false positive rates, when permuting phenotypes, were PLAGE, GLOBALTEST, and PADOG. The best method in the category that generated higher than expected false positives was MRGSE.

引用

页数：10

共 50 条

[21] Prioritization of Susceptibility Genes for Ectopic Pregnancy by Gene Network Analysis
Liu, Ji-Long
Zhao, Miao
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (02)
[22] CGPS: A machine learning-based approach integrating multiple gene set analysis tools for better prioritization of biologically relevant pathways
Ai, Chen
Kong, Lei
JOURNAL OF GENETICS AND GENOMICS, 2018, 45 (09) : 489 - 504
[23] Gene-set distance analysis (GSDA): a powerful tool for gene-set association analysis
Cao, Xueyuan
Pounds, Stan
BMC BIOINFORMATICS, 2021, 22 (01)
[24] Popularity and performance of bioinformatics software: the case of gene set analysis
Xie, Chengshu
Jauhari, Shaurya
Mora, Antonio
BMC BIOINFORMATICS, 2021, 22 (01)
[25] Gene set analysis of purine and pyrimidine antimetabolites cancer therapies
Fridley, Brooke L.
Batzler, Anthony
Li, Liang
Li, Fang
Matimba, Alice
Jenkins, Gregory D.
Ji, Yuan
Wang, Liewei
Weinshilboum, Richard M.
PHARMACOGENETICS AND GENOMICS, 2011, 21 (11) : 701 - 712
[26] Functional-Network-Based Gene Set Analysis Using Gene-Ontology
Chang, Billy
Kustra, Rafal
Tian, Weidong
PLOS ONE, 2013, 8 (02):
[27] Gene-set analysis and reduction
Dinu, Irina
Potter, John D.
Mueller, Thomas
Liu, Qi
Adewale, Adeniyi J.
Jhangri, Gian S.
Einecke, Gunilla
Famulski, Konrad S.
Halloran, Philip
Yasui, Yutaka
BRIEFINGS IN BIOINFORMATICS, 2009, 10 (01) : 24 - 34
[28] Disease and phenotype gene set analysis of disease-based gene expression in mouse and human
De, Supriyo
Zhang, Yongqing
Garner, John R.
Wang, S. Alex
Becker, Kevin G.
PHYSIOLOGICAL GENOMICS, 2010, 42A (02) : 162 - 167
[29] Predicted Arabidopsis Interactome Resource and Gene Set Linkage Analysis: A Transcriptomic Analysis Resource
Yao, Heng
Wang, Xiaoxuan
Chen, Pengcheng
Hai, Ling
Jin, Kang
Yao, Lixia
Mao, Chuanzao
Chen, Xin
PLANT PHYSIOLOGY, 2018, 177 (01) : 422 - 433
[30] GScluster: network-weighted gene-set clustering analysis
Yoon, Sora
Kim, Jinhwan
Kim, Seon-Kyu
Baik, Bukyung
Chi, Sang-Mun
Kim, Seon-Young
Nam, Dougu
BMC GENOMICS, 2019, 20 (1)

← 1 2 3 4 5 →