eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale

被引:2324
作者
Cantalapiedra, Carlos P. [1 ]
Hernandez-Plaza, Ana [1 ]
Letunic, Ivica [2 ]
Bork, Peer [3 ,4 ,5 ]
Huerta-Cepas, Jaime [1 ]
机构
[1] Univ Politecn Madrid UPM, Ctr Biotecnol & Genom Plantas, Inst Nacl Invest & Tecnol Agr & Alimentaria INIA, Campus Montegancedo UPM, Madrid, Spain
[2] Biobyte Solut GmbH, Heidelberg, Germany
[3] European Mol Biol Lab, Struct & Computat Biol Unit, Heidelberg, Germany
[4] Univ Wurzburg, Bioctr, Dept Bioinformat, Wurzburg, Germany
[5] Yonsei Univ, Yonsei Frontier Lab YFL, Seoul, South Korea
基金
欧洲研究理事会;
关键词
metagenomics; functional annotation; computational genomics; bioinformatics; DATABASE;
D O I
10.1093/molbev/msab293
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Even though automated functional annotation of genes represents a fundamental step in most genomic and metagenomic workflows, it remains challenging at large scales. Here, we describe a major upgrade to eggNOG-mapper, a tool for functional annotation based on precomputed orthology assignments, now optimized for vast (meta)genomic data sets. Improvements in version 2 include a full update of both the genomes and functional databases to those from eggNOG v5, as well as several efficiency enhancements and new features. Most notably, eggNOG-mapper v2 now allows for: 1) de novo gene prediction from raw contigs, 2) built-in pairwise orthology prediction, 3) fast protein domain discovery, and 4) automated GFF decoration. eggNOG-mapper v2 is available as a standalone tool or as an online service at http://eggnogmapper.embl.de.
引用
收藏
页码:5825 / 5829
页数:5
相关论文
共 23 条
[1]   A unified catalog of 204,938 reference genomes from the human gut microbiome [J].
Almeida, Alexandre ;
Nayfach, Stephen ;
Boland, Miguel ;
Strozzi, Francesco ;
Beracochea, Martin ;
Shi, Zhou Jason ;
Pollard, Katherine S. ;
Sakharova, Ekaterina ;
Parks, Donovan H. ;
Hugenholtz, Philip ;
Segata, Nicola ;
Kyrpides, Nikos C. ;
Finn, Robert D. .
NATURE BIOTECHNOLOGY, 2021, 39 (01) :105-114
[2]   UniProt: the universal protein knowledgebase in 2021 [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Agivetova, Rahat ;
Ahmad, Shadab ;
Alpi, Emanuele ;
Bowler-Barnett, Emily H. ;
Britto, Ramona ;
Bursteinas, Borisas ;
Bye-A-Jee, Hema ;
Coetzee, Ray ;
Cukura, Austra ;
Da Silva, Alan ;
Denny, Paul ;
Dogan, Tunca ;
Ebenezer, ThankGod ;
Fan, Jun ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzales, Leonardo ;
Hatton-Ellis, Emma ;
Hussein, Abdulrahman ;
Ignatchenko, Alexandr ;
Insana, Giuseppe ;
Ishtiaq, Rizwan ;
Jokinen, Petteri ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Lock, Antonia ;
Lopez, Rodrigo ;
Luciani, Aurelien ;
Luo, Jie ;
Lussi, Yvonne ;
Mac-Dougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Menchi, Manuela ;
Mishra, Alok ;
Moulang, Katie ;
Nightingale, Andrew ;
Oliveira, Carla Susana ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Rice, Daniel ;
Lopez, Milagros Rodriguez ;
Saidi, Rabie ;
Sampson, Joseph .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D480-D489
[3]   The InterPro protein families and domains database: 20 years on [J].
Blum, Matthias ;
Chang, Hsin-Yu ;
Chuguransky, Sara ;
Grego, Tiago ;
Kandasaamy, Swaathi ;
Mitchell, Alex ;
Nuka, Gift ;
Paysan-Lafosse, Typhaine ;
Qureshi, Matloob ;
Raj, Shriya ;
Richardson, Lorna ;
Salazar, Gustavo A. ;
Williams, Lowri ;
Bork, Peer ;
Bridge, Alan ;
Gough, Julian ;
Haft, Daniel H. ;
Letunic, Ivica ;
Marchler-Bauer, Aron ;
Mi, Huaiyu ;
Natale, Darren A. ;
Necci, Marco ;
Orengo, Christine A. ;
Pandurangan, Arun P. ;
Rivoire, Catherine ;
Sigrist, Christian J. A. ;
Sillitoe, Ian ;
Thanki, Narmada ;
Thomas, Paul D. ;
Tosatto, Silvio C. E. ;
Wu, Cathy H. ;
Bateman, Alex ;
Finn, Robert D. .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D344-D354
[4]   The Gene Ontology Resource: 20 years and still GOing strong [J].
Carbon, S. ;
Douglass, E. ;
Dunn, N. ;
Good, B. ;
Harris, N. L. ;
Lewis, S. E. ;
Mungall, C. J. ;
Basu, S. ;
Chisholm, R. L. ;
Dodson, R. J. ;
Hartline, E. ;
Fey, P. ;
Thomas, P. D. ;
Albou, L. P. ;
Ebert, D. ;
Kesling, M. J. ;
Mi, H. ;
Muruganujian, A. ;
Huang, X. ;
Poudel, S. ;
Mushayahama, T. ;
Hu, J. C. ;
LaBonte, S. A. ;
Siegele, D. A. ;
Antonazzo, G. ;
Attrill, H. ;
Brown, N. H. ;
Fexova, S. ;
Garapati, P. ;
Jones, T. E. M. ;
Marygold, S. J. ;
Millburn, G. H. ;
Rey, A. J. ;
Trovisco, V. ;
dos Santos, G. ;
Emmert, D. B. ;
Falls, K. ;
Zhou, P. ;
Goodman, J. L. ;
Strelets, V. B. ;
Thurmond, J. ;
Courtot, M. ;
Osumi-Sutherland, D. ;
Parkinson, H. ;
Roncaglia, P. ;
Acencio, M. L. ;
Kuiper, M. ;
Laegreid, A. ;
Logie, C. ;
Lovering, R. C. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D330-D338
[5]   Functional and evolutionary implications of gene orthology [J].
Gabaldon, Toni ;
Koonin, Eugene V. .
NATURE REVIEWS GENETICS, 2013, 14 (05) :360-366
[6]   Advances and Applications in the Quest for Orthologs [J].
Glover, Natasha ;
Dessimoz, Christophe ;
Ebersberger, Ingo ;
Forslund, Sofia K. ;
Gabaldon, Toni ;
Huerta-Cepas, Jaime ;
Martin, Maria-Jesus ;
Muffato, Matthieu ;
Patricio, Mateus ;
Pereira, Cecile ;
da Silva, Alan Sousa ;
Wang, Yan ;
Sonnhammer, Erik ;
Thomas, Paul D. ;
Altenhoff, Adrian ;
Blake, Judith A. ;
Capella-Gutierrez, Salvador ;
Chiba, Hirokazu ;
Dessimoz, Christophe ;
Durand, Dannie ;
Ebersberger, Ingo ;
Fernandez-Breis, Jesualdo Tomas ;
Forslund, Sofia ;
Gabaldon, Toni ;
Glover, Natasha ;
Huerta-Cepas, Jaime ;
Lecompte, Odile ;
Lewis, Suzanna ;
Linard, Benjamin ;
Houben, Marina Marcet ;
Marcotte, Edward M. ;
Martin, Maria-Jesus ;
McWhite, Claire ;
de Farias, Tarcisio Mendes ;
Muffato, Matthieu ;
Nevers, Yannis ;
Patricio, Mateus ;
Pereira, Cecile ;
Pryszcz, Leszek ;
Saha, Surya ;
Schiffer, Philipp ;
Sonnhammer, Erik ;
da Silva, Alan Sousa ;
Tang, Haiming ;
Thomas, Paul D. ;
Uchiyama, Ikuo ;
Wang, Yan .
MOLECULAR BIOLOGY AND EVOLUTION, 2019, 36 (10) :2157-2164
[7]   High-throughput functional annotation and data mining with the Blast2GO suite [J].
Gotz, Stefan ;
Garcia-Gomez, Juan Miguel ;
Terol, Javier ;
Williams, Tim D. ;
Nagaraj, Shivashankar H. ;
Nueda, Maria Jose ;
Robles, Montserrat ;
Talon, Manuel ;
Dopazo, Joaquin ;
Conesa, Ana .
NUCLEIC ACIDS RESEARCH, 2008, 36 (10) :3420-3435
[8]   eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses [J].
Huerta-Cepas, Jaime ;
Szklarczyk, Damian ;
Heller, Davide ;
Hernandez-Plaza, Ana ;
Forslund, Sofia K. ;
Cook, Helen ;
Mende, Daniel R. ;
Letunic, Ivica ;
Rattei, Thomas ;
Jensen, Lars J. ;
von Mering, Christian ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D309-D314
[9]   Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper [J].
Huerta-Cepas, Jaime ;
Forslund, Kristoffer ;
Coelho, Luis Pedro ;
Szklarczyk, Damian ;
Jensen, Lars Juhl ;
von Mering, Christian ;
Bork, Peer .
MOLECULAR BIOLOGY AND EVOLUTION, 2017, 34 (08) :2115-2122
[10]   Prodigal: prokaryotic gene recognition and translation initiation site identification [J].
Hyatt, Doug ;
Chen, Gwo-Liang ;
LoCascio, Philip F. ;
Land, Miriam L. ;
Larimer, Frank W. ;
Hauser, Loren J. .
BMC BIOINFORMATICS, 2010, 11