A unified catalog of 204,938 reference genomes from the human gut microbiome

被引:715
作者
Almeida, Alexandre [1 ,2 ]
Nayfach, Stephen [3 ,4 ]
Boland, Miguel [1 ]
Strozzi, Francesco [5 ]
Beracochea, Martin [1 ]
Shi, Zhou Jason [6 ,7 ]
Pollard, Katherine S. [6 ,7 ,8 ,9 ,10 ,11 ]
Sakharova, Ekaterina [1 ]
Parks, Donovan H. [12 ]
Hugenholtz, Philip [12 ]
Segata, Nicola [13 ]
Kyrpides, Nikos C. [3 ,4 ]
Finn, Robert D. [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, Wellcome Genome Campus, Hinxton, England
[2] Wellcome Sanger Inst, Wellcome Genome Campus, Hinxton, England
[3] US DOE, Joint Genome Inst, Walnut Creek, CA USA
[4] Lawrence Berkeley Natl Lab, Environm Genom & Syst Biol Div, Berkeley, CA USA
[5] Enterome Biosci, Paris, France
[6] Gladstone Inst, San Francisco, CA USA
[7] Chan Zuckerberg Biohub, San Francisco, CA USA
[8] Univ Calif San Francisco, Inst Human Genet, San Francisco, CA 94143 USA
[9] Univ Calif San Francisco, Inst Computat Hlth Sci, San Francisco, CA 94143 USA
[10] Univ Calif San Francisco, Quantitat Biol, San Francisco, CA 94143 USA
[11] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
[12] Univ Queensland, Sch Chem & Mol Biosci, Australian Ctr Ecogen, Brisbane, Qld, Australia
[13] Univ Trento, CIBIO Dept, Trento, Italy
基金
欧洲研究理事会; 英国生物技术与生命科学研究理事会;
关键词
READ ALIGNMENT; ASSEMBLED GENOMES; ANALYSIS RESOURCE; ANNOTATION; BACTERIAL; GENES; VERSATILE; COVERAGE; DATABASE; CULTURE;
D O I
10.1038/s41587-020-0603-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes encode >170 million protein sequences, which we collated in the Unified Human Gastrointestinal Protein (UHGP) catalog. The UHGP more than doubles the number of gut proteins in comparison to those present in the Integrated Gene Catalog. More than 70% of the UHGG species lack cultured representatives, and 40% of the UHGP lack functional annotations. Intraspecies genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which are specific to individual human populations. The UHGG and UHGP collections will enable studies linking genotypes to phenotypes in the human gut microbiome.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 75 条
[1]   A new genomic blueprint of the human gut microbiota [J].
Almeida, Alexandre ;
Mitchell, Alex L. ;
Boland, Miguel ;
Forster, Samuel C. ;
Gloor, Gregory B. ;
Tarkowska, Aleksandra ;
Lawley, Trevor D. ;
Finn, Robert D. .
NATURE, 2019, 568 (7753) :499-+
[2]  
Alneberg J, 2014, NAT METHODS, V11, P1144, DOI [10.1038/NMETH.3103, 10.1038/nmeth.3103]
[3]   The European Nucleotide Archive in 2019 [J].
Amid, Clara ;
Alako, Blaise T. F. ;
Kadhirvelu, Vishnukumar Balavenkataraman ;
Burdett, Tony ;
Burgin, Josephine ;
Fan, Jun ;
Harrison, Peter W. ;
Holt, Sam ;
Hussein, Abdulrahman ;
Ivanov, Eugene ;
Jayathilaka, Suran ;
Kay, Simon ;
Keane, Thomas ;
Leinonen, Rasko ;
Liu, Xin ;
Martinez-Villacorta, Josue ;
Milano, Annalisa ;
Pakseresht, Amir ;
Rahman, Nadim ;
Rajan, Jeena ;
Reddy, Kethi ;
Richards, Edward ;
Smirnov, Dmitriy ;
Sokolov, Alexey ;
Vijayaraja, Senthilnathan ;
Cochrane, Guy .
NUCLEIC ACIDS RESEARCH, 2020, 48 (D1) :D70-D76
[4]   Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system [J].
Anantharaman, Karthik ;
Brown, Christopher T. ;
Hug, Laura A. ;
Sharon, Itai ;
Castelle, Cindy J. ;
Probst, Alexander J. ;
Thomas, Brian C. ;
Singh, Andrea ;
Wilkins, Michael J. ;
Karaoz, Ulas ;
Brodie, Eoin L. ;
Williams, Kenneth H. ;
Hubbard, Susan S. ;
Banfield, Jillian F. .
NATURE COMMUNICATIONS, 2016, 7
[5]   A Metagenomic Meta-analysis Reveals Functional Signatures of Health and Disease in the Human Gut Microbiome [J].
Armour, Courtney R. ;
Nayfach, Stephen ;
Pollard, Katherine S. ;
Sharpton, Thomas J. .
MSYSTEMS, 2019, 4 (04)
[6]   Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea [J].
Bowers, Robert M. ;
Kyrpides, Nikos C. ;
Stepanauskas, Ramunas ;
Harmon-Smith, Miranda ;
Doud, Devin ;
Reddy, T. B. K. ;
Schulz, Frederik ;
Jarett, Jessica ;
Rivers, Adam R. ;
Eloe-Fadrosh, Emiley A. ;
Tringe, Susannah G. ;
Ivanova, Natalia N. ;
Copeland, Alex ;
Clum, Alicia ;
Becraft, Eric D. ;
Malmstrom, Rex R. ;
Birren, Bruce ;
Podar, Mircea ;
Bork, Peer ;
Weinstock, George M. ;
Garrity, George M. ;
Dodsworth, Jeremy A. ;
Yooseph, Shibu ;
Sutton, Granger ;
Gloeckner, Frank O. ;
Gilbert, Jack A. ;
Nelson, William C. ;
Hallam, Steven J. ;
Jungbluth, Sean P. ;
Ettema, Thijs J. G. ;
Tighe, Scott ;
Konstantinidis, Konstantinos T. ;
Liu, Wen-Tso ;
Baker, Brett J. ;
Rattei, Thomas ;
Eisen, Jonathan A. ;
Hedlund, Brian ;
McMahon, Katherine D. ;
Fierer, Noah ;
Knight, Rob ;
Finn, Rob ;
Cochrane, Guy ;
Karsch-Mizrachi, Ilene ;
Tyson, Gene W. ;
Rinke, Christian ;
Lapidus, Alla ;
Meyer, Folker ;
Yilmaz, Pelin ;
Parks, Donovan H. ;
Eren, A. M. .
NATURE BIOTECHNOLOGY, 2017, 35 (08) :725-731
[7]   Ultrafast search of all deposited bacterial and viral genomic data [J].
Bradley, Phelim ;
den Bakker, Henk C. ;
Rocha, Eduardo P. C. ;
McVean, Gil ;
Iqbal, Zamin .
NATURE BIOTECHNOLOGY, 2019, 37 (02) :152-+
[8]   Culturing of 'unculturable' human microbiota reveals novel taxa and extensive sporulation [J].
Browne, Hilary P. ;
Forster, Samuel C. ;
Anonye, Blessing O. ;
Kumar, Nitin ;
Neville, B. Anne ;
Stares, Mark D. ;
Goulding, David ;
Lawley, Trevor D. .
NATURE, 2016, 533 (7604) :543-+
[9]   Fast and sensitive protein alignment using DIAMOND [J].
Buchfink, Benjamin ;
Xie, Chao ;
Huson, Daniel H. .
NATURE METHODS, 2015, 12 (01) :59-60
[10]   Improved protein-ligand binding affinity prediction by using a curvature-dependent surface-area model [J].
Cao, Yang ;
Li, Lei .
BIOINFORMATICS, 2014, 30 (12) :1674-1680