modEnrichr: a suite of gene set enrichment analysis tools for model organisms

被引:59
作者
Kuleshov, Maxim V. [1 ]
Diaz, Jennifer E. L. [2 ]
Flamholz, Zachary N. [1 ]
Keenan, Alexandra B. [1 ]
Lachmann, Alexander [1 ]
Wojciechowicz, Megan L. [1 ]
Cagan, Ross L. [2 ]
Ma'ayan, Avi [1 ]
机构
[1] Icahn Sch Med Mt Sinai, Mt Sinai Ctr Bioinformat, Dept Pharmacol Sci, One Gustave L Levy Pl,Box 1215, New York, NY 10029 USA
[2] Icahn Sch Med Mt Sinai, Dept Cell Dev & Regenerat Biol, One Gustave L Levy Pl,Box 1020, New York, NY 10029 USA
关键词
WEB SERVER; PREDICTION; ONTOLOGY; ANNOTATION; DATABASE; PROTEIN;
D O I
10.1093/nar/gkz347
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput experiments produce increasingly large datasets that are difficult to analyze and integrate. While most data integration approaches focus on aligning metadata, data integration can be achieved by abstracting experimental results into gene sets. Such gene sets can be made available for reuse through gene set enrichment analysis tools such as Enrichr. Enrichr currently only supports gene sets compiled from human and mouse, limiting accessibility for investigators that study other model organisms. modEnrichr is an expansion of Enrichr for four model organisms: fish, fly, worm and yeast. The gene set libraries within FishEnrichr, FlyEnrichr, WormEnrichr and YeastEnrichr are created from the Gene Ontology, mRNA expression profiles, GeneRIF, pathway databases, protein domain databases and other organism-specific resources. Additionally, libraries were created by predicting gene function from RNA-seq co-expression data processed uniformly from the gene expression omnibus for each organism. The modEnrichr suite of tools provides the ability to convert gene lists across species using an ortholog conversion tool that automatically detects the species. For complex analyses, modEnrichr provides API access that enables submitting batch queries. In summary, modEnrichr leverages existing model organism databases and other resources to facilitate comprehensive hypothesis generation through data integration.
引用
收藏
页码:W183 / W190
页数:8
相关论文
共 51 条
[1]   BABELOMICS:: a systems biology perspective in the functional annotation of genome-scale experiments [J].
Al-Shahrour, Fatima ;
Minguez, Pablo ;
Tarraga, Joaquin ;
Montaner, David ;
Alloza, Eva ;
Vaquerizas, Juan M. ;
Conde, Lucia ;
Blaschke, Christian ;
Vera, Javier ;
Dopazo, Joaquin .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W472-W476
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   Hierarchical multi-label prediction of gene function [J].
Barutcuoglu, Z ;
Schapire, RE ;
Troyanskaya, OG .
BIOINFORMATICS, 2006, 22 (07) :830-836
[4]   Near-optimal probabilistic RNA-seq quantification (vol 34, pg 525, 2016) [J].
Bray, Nicolas L. ;
Pimentel, Harold ;
Melsted, Pall ;
Pachter, Lior .
NATURE BIOTECHNOLOGY, 2016, 34 (08) :888-888
[5]   AmiGO: online access to ontology and annotation data [J].
Carbon, Seth ;
Ireland, Amelia ;
Mungall, Christopher J. ;
Shu, ShengQiang ;
Marshall, Brad ;
Lewis, Suzanna .
BIOINFORMATICS, 2009, 25 (02) :288-289
[6]   Enrichr: interactive and collaborative HTML']HTML5 gene list enrichment analysis tool [J].
Chen, Edward Y. ;
Tan, Christopher M. ;
Kou, Yan ;
Duan, Qiaonan ;
Wang, Zichen ;
Meirelles, Gabriela Vaz ;
Clark, Neil R. ;
Ma'ayan, Avi .
BMC BIOINFORMATICS, 2013, 14
[7]   ToppGene Suite for gene list enrichment analysis and candidate gene prioritization [J].
Chen, Jing ;
Bardes, Eric E. ;
Aronow, Bruce J. ;
Jegga, Anil G. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W305-W311
[8]   SGD:: Saccharomyces Genome Database [J].
Cherry, JM ;
Adler, C ;
Ball, C ;
Chervitz, SA ;
Dwight, SS ;
Hester, ET ;
Jia, YK ;
Juvik, G ;
Roe, T ;
Schroeder, M ;
Weng, SA ;
Botstein, D .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :73-79
[9]   Biopython']python: freely available Python']Python tools for computational molecular biology and bioinformatics [J].
Cock, Peter J. A. ;
Antao, Tiago ;
Chang, Jeffrey T. ;
Chapman, Brad A. ;
Cox, Cymon J. ;
Dalke, Andrew ;
Friedberg, Iddo ;
Hamelryck, Thomas ;
Kauff, Frank ;
Wilczynski, Bartek ;
de Hoon, Michiel J. L. .
BIOINFORMATICS, 2009, 25 (11) :1422-1423
[10]   The Pfam protein families database in 2019 [J].
El-Gebali, Sara ;
Mistry, Jaina ;
Bateman, Alex ;
Eddy, Sean R. ;
Luciani, Aurelien ;
Potter, Simon C. ;
Qureshi, Matloob ;
Richardson, Lorna J. ;
Salazar, Gustavo A. ;
Smart, Alfredo ;
Sonnhammer, Erik L. L. ;
Hirsh, Layla ;
Paladin, Lisanna ;
Piovesan, Damiano ;
Tosatto, Silvio C. E. ;
Finn, Robert D. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D427-D432