1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life

被引:179
作者
Mukherjee, Supratim [1 ,10 ]
Seshadri, Rekha [1 ,10 ]
Varghese, Neha J. [1 ]
Eloe-Fadrosh, Emiley A. [1 ]
Meier-Kolthoff, Jan P. [2 ]
Goeker, Markus [2 ]
Coates, R. Cameron [1 ,9 ]
Hadjithomas, Michalis [1 ]
Pavlopoulos, Georgios A. [1 ]
Paez-Espino, David [1 ]
Yoshikuni, Yasuo [1 ]
Visel, Axel [1 ]
Whitman, William B. [3 ]
Garrity, George M. [4 ,5 ]
Eisen, Jonathan A. [6 ]
Hugenholtz, Philip [7 ]
Pati, Amrita [1 ,9 ]
Ivanova, Natalia N. [1 ]
Woyke, Tanja [1 ]
Klenk, Hans-Peter [8 ]
Kyrpides, Nikos C. [1 ]
机构
[1] US DOE, Joint Genome Inst, Walnut Creek, CA 94598 USA
[2] DSMZ German Collect Microorganisms & Cell Culture, Leibniz Inst, Braunschweig, Germany
[3] Univ Georgia, Dept Microbiol, Athens, GA 30602 USA
[4] Michigan State Univ, Dept Microbiol & Mol Genet, E Lansing, MI 48824 USA
[5] NamesforLife LLC, E Lansing, MI USA
[6] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
[7] Univ Queensland, Australian Ctr Ecogen, Brisbane, Qld, Australia
[8] Newcastle Univ, Sch Biol, Newcastle Upon Tyne, Tyne & Wear, England
[9] Zymergen Inc, Emeryville, CA USA
[10] Roche Mol Syst Inc, Pleasanton, CA USA
关键词
STANDARD OPERATING PROCEDURE; SP-NOV; GENE; SEQUENCE; ANNOTATION; PHYLOGENY; PIPELINE; SOIL; CLASSIFICATION; MYCOBACTERIA;
D O I
10.1038/nbt.3886
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster with potential new chemical structure and antimicrobial activity. This Resource is the largest single release of reference genomes to date. Bacterial and archaeal isolate sequence space is still far from saturated, and future endeavors in this direction will continue to be a valuable resource for scientific discovery.
引用
收藏
页码:676 / +
页数:10
相关论文
共 72 条
[1]   Complete genome sequence of the termite hindgut bacterium Spirochaeta coccoides type strain (SPN1T), reclassification in the genus Sphaerochaeta as Sphaerochaeta coccoides comb. nov and emendations of the family Spirochaetaceae and the genus Sphaerochaeta [J].
Abt, Birte ;
Han, Cliff ;
Scheuner, Carmen ;
Lu, Megan ;
Lapidus, Alla ;
Nolan, Matt ;
Lucas, Susan ;
Hammon, Nancy ;
Deshpande, Shweta ;
Cheng, Jan-Fang ;
Tapia, Roxanne ;
Goodwin, Lynne A. ;
Pitluck, Sam ;
Liolios, Konstantinos ;
Pagani, Ioanna ;
Ivanova, Natalia ;
Mavromatis, Konstantinos ;
Mikhailova, Natalia ;
Huntemann, Marcel ;
Pati, Amrita ;
Chen, Amy ;
Palaniappan, Krishna ;
Land, Miriam ;
Hauser, Loren ;
Brambilla, Evelyne-Marie ;
Rohde, Manfred ;
Spring, Stefan ;
Gronow, Sabine ;
Goeker, Markus ;
Woyke, Tanja ;
Bristow, James ;
Eisen, Jonathan A. ;
Markowitz, Victor ;
Hugenholtz, Philip ;
Kyrpides, Nikos C. ;
Klenk, Hans-Peter ;
Detter, John C. .
STANDARDS IN GENOMIC SCIENCES, 2012, 6 (02) :194-209
[2]  
An DS, 2016, FRONT MICROBIOL, V7, DOI [10.3589/frricb.7016.00028, 10.3389/fmicb.2016.00028]
[3]   Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system [J].
Anantharaman, Karthik ;
Brown, Christopher T. ;
Hug, Laura A. ;
Sharon, Itai ;
Castelle, Cindy J. ;
Probst, Alexander J. ;
Thomas, Brian C. ;
Singh, Andrea ;
Wilkins, Michael J. ;
Karaoz, Ulas ;
Brodie, Eoin L. ;
Williams, Kenneth H. ;
Hubbard, Susan S. ;
Banfield, Jillian F. .
NATURE COMMUNICATIONS, 2016, 7
[4]   Objective: biochemical function [J].
Anton, Brian P. ;
Kasif, Simon ;
Roberts, Richard J. ;
Steffen, Martin .
FRONTIERS IN GENETICS, 2014, 5
[5]   Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences [J].
Auch, Alexander F. ;
Henz, Stefan R. ;
Holland, Barbara R. ;
Goeker, Markus .
BMC BIOINFORMATICS, 2006, 7 (1)
[6]   Divorcing Strain Classification from Species Names [J].
Baltrus, David A. .
TRENDS IN MICROBIOLOGY, 2016, 24 (06) :431-439
[7]   Bioactive microbial metabolites -: A personal view [J].
Bérdy, J .
JOURNAL OF ANTIBIOTICS, 2005, 58 (01) :1-26
[8]   ALLPATHS: De novo assembly of whole-genome shotgun microreads [J].
Butler, Jonathan ;
MacCallum, Iain ;
Kleber, Michael ;
Shlyakhter, Ilya A. ;
Belmonte, Matthew K. ;
Lander, Eric S. ;
Nusbaum, Chad ;
Jaffe, David B. .
GENOME RESEARCH, 2008, 18 (05) :810-820
[9]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[10]   Non-contiguous finished genome sequence and contextual data of the filamentous soil bacterium Ktedonobacter racemifer type strain (SOSP1-21T) [J].
Chang, Yun-juan ;
Land, Miriam ;
Hauser, Loren ;
Chertkov, Olga ;
Del Rio, Tijana Glavina ;
Nolan, Matt ;
Copeland, Alex ;
Tice, Hope ;
Cheng, Jan-Fang ;
Lucas, Susan ;
Han, Cliff ;
Goodwin, Lynne ;
Pitluck, Sam ;
Ivanova, Natalia ;
Ovchinikova, Galina ;
Pati, Amrita ;
Chen, Amy ;
Palaniappan, Krishna ;
Mavromatis, Konstantinos ;
Liolios, Konstantinos ;
Brettin, Thomas ;
Fiebig, Anne ;
Rohde, Manfred ;
Abt, Birte ;
Goeker, Markus ;
Detter, John C. ;
Woyke, Tanja ;
Bristow, James ;
Eisen, Jonathan A. ;
Markowitz, Victor ;
Hugenholtz, Philip ;
Kyrpides, Nikos C. ;
Klenk, Hans-Peter ;
Lapidus, Alla .
STANDARDS IN GENOMIC SCIENCES, 2011, 5 (01) :97-111