A unified catalog of 204,938 reference genomes from the human gut microbiome

被引:654
作者
Almeida, Alexandre [1 ,2 ]
Nayfach, Stephen [3 ,4 ]
Boland, Miguel [1 ]
Strozzi, Francesco [5 ]
Beracochea, Martin [1 ]
Shi, Zhou Jason [6 ,7 ]
Pollard, Katherine S. [6 ,7 ,8 ,9 ,10 ,11 ]
Sakharova, Ekaterina [1 ]
Parks, Donovan H. [12 ]
Hugenholtz, Philip [12 ]
Segata, Nicola [13 ]
Kyrpides, Nikos C. [3 ,4 ]
Finn, Robert D. [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, Wellcome Genome Campus, Hinxton, England
[2] Wellcome Sanger Inst, Wellcome Genome Campus, Hinxton, England
[3] US DOE, Joint Genome Inst, Walnut Creek, CA USA
[4] Lawrence Berkeley Natl Lab, Environm Genom & Syst Biol Div, Berkeley, CA USA
[5] Enterome Biosci, Paris, France
[6] Gladstone Inst, San Francisco, CA USA
[7] Chan Zuckerberg Biohub, San Francisco, CA USA
[8] Univ Calif San Francisco, Inst Human Genet, San Francisco, CA 94143 USA
[9] Univ Calif San Francisco, Inst Computat Hlth Sci, San Francisco, CA 94143 USA
[10] Univ Calif San Francisco, Quantitat Biol, San Francisco, CA 94143 USA
[11] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
[12] Univ Queensland, Sch Chem & Mol Biosci, Australian Ctr Ecogen, Brisbane, Qld, Australia
[13] Univ Trento, CIBIO Dept, Trento, Italy
基金
欧洲研究理事会; 英国生物技术与生命科学研究理事会;
关键词
READ ALIGNMENT; ASSEMBLED GENOMES; ANALYSIS RESOURCE; ANNOTATION; BACTERIAL; GENES; VERSATILE; COVERAGE; DATABASE; CULTURE;
D O I
10.1038/s41587-020-0603-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes encode >170 million protein sequences, which we collated in the Unified Human Gastrointestinal Protein (UHGP) catalog. The UHGP more than doubles the number of gut proteins in comparison to those present in the Integrated Gene Catalog. More than 70% of the UHGG species lack cultured representatives, and 40% of the UHGP lack functional annotations. Intraspecies genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which are specific to individual human populations. The UHGG and UHGP collections will enable studies linking genotypes to phenotypes in the human gut microbiome.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 75 条
  • [1] A new genomic blueprint of the human gut microbiota
    Almeida, Alexandre
    Mitchell, Alex L.
    Boland, Miguel
    Forster, Samuel C.
    Gloor, Gregory B.
    Tarkowska, Aleksandra
    Lawley, Trevor D.
    Finn, Robert D.
    [J]. NATURE, 2019, 568 (7753) : 499 - +
  • [2] Alneberg J, 2014, NAT METHODS, V11, P1144, DOI [10.1038/NMETH.3103, 10.1038/nmeth.3103]
  • [3] The European Nucleotide Archive in 2019
    Amid, Clara
    Alako, Blaise T. F.
    Kadhirvelu, Vishnukumar Balavenkataraman
    Burdett, Tony
    Burgin, Josephine
    Fan, Jun
    Harrison, Peter W.
    Holt, Sam
    Hussein, Abdulrahman
    Ivanov, Eugene
    Jayathilaka, Suran
    Kay, Simon
    Keane, Thomas
    Leinonen, Rasko
    Liu, Xin
    Martinez-Villacorta, Josue
    Milano, Annalisa
    Pakseresht, Amir
    Rahman, Nadim
    Rajan, Jeena
    Reddy, Kethi
    Richards, Edward
    Smirnov, Dmitriy
    Sokolov, Alexey
    Vijayaraja, Senthilnathan
    Cochrane, Guy
    [J]. NUCLEIC ACIDS RESEARCH, 2020, 48 (D1) : D70 - D76
  • [4] Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system
    Anantharaman, Karthik
    Brown, Christopher T.
    Hug, Laura A.
    Sharon, Itai
    Castelle, Cindy J.
    Probst, Alexander J.
    Thomas, Brian C.
    Singh, Andrea
    Wilkins, Michael J.
    Karaoz, Ulas
    Brodie, Eoin L.
    Williams, Kenneth H.
    Hubbard, Susan S.
    Banfield, Jillian F.
    [J]. NATURE COMMUNICATIONS, 2016, 7
  • [5] A Metagenomic Meta-analysis Reveals Functional Signatures of Health and Disease in the Human Gut Microbiome
    Armour, Courtney R.
    Nayfach, Stephen
    Pollard, Katherine S.
    Sharpton, Thomas J.
    [J]. MSYSTEMS, 2019, 4 (04)
  • [6] Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea
    Bowers, Robert M.
    Kyrpides, Nikos C.
    Stepanauskas, Ramunas
    Harmon-Smith, Miranda
    Doud, Devin
    Reddy, T. B. K.
    Schulz, Frederik
    Jarett, Jessica
    Rivers, Adam R.
    Eloe-Fadrosh, Emiley A.
    Tringe, Susannah G.
    Ivanova, Natalia N.
    Copeland, Alex
    Clum, Alicia
    Becraft, Eric D.
    Malmstrom, Rex R.
    Birren, Bruce
    Podar, Mircea
    Bork, Peer
    Weinstock, George M.
    Garrity, George M.
    Dodsworth, Jeremy A.
    Yooseph, Shibu
    Sutton, Granger
    Gloeckner, Frank O.
    Gilbert, Jack A.
    Nelson, William C.
    Hallam, Steven J.
    Jungbluth, Sean P.
    Ettema, Thijs J. G.
    Tighe, Scott
    Konstantinidis, Konstantinos T.
    Liu, Wen-Tso
    Baker, Brett J.
    Rattei, Thomas
    Eisen, Jonathan A.
    Hedlund, Brian
    McMahon, Katherine D.
    Fierer, Noah
    Knight, Rob
    Finn, Rob
    Cochrane, Guy
    Karsch-Mizrachi, Ilene
    Tyson, Gene W.
    Rinke, Christian
    Lapidus, Alla
    Meyer, Folker
    Yilmaz, Pelin
    Parks, Donovan H.
    Eren, A. M.
    [J]. NATURE BIOTECHNOLOGY, 2017, 35 (08) : 725 - 731
  • [7] Ultrafast search of all deposited bacterial and viral genomic data
    Bradley, Phelim
    den Bakker, Henk C.
    Rocha, Eduardo P. C.
    McVean, Gil
    Iqbal, Zamin
    [J]. NATURE BIOTECHNOLOGY, 2019, 37 (02) : 152 - +
  • [8] Culturing of 'unculturable' human microbiota reveals novel taxa and extensive sporulation
    Browne, Hilary P.
    Forster, Samuel C.
    Anonye, Blessing O.
    Kumar, Nitin
    Neville, B. Anne
    Stares, Mark D.
    Goulding, David
    Lawley, Trevor D.
    [J]. NATURE, 2016, 533 (7604) : 543 - +
  • [9] Fast and sensitive protein alignment using DIAMOND
    Buchfink, Benjamin
    Xie, Chao
    Huson, Daniel H.
    [J]. NATURE METHODS, 2015, 12 (01) : 59 - 60
  • [10] Improved protein-ligand binding affinity prediction by using a curvature-dependent surface-area model
    Cao, Yang
    Li, Lei
    [J]. BIOINFORMATICS, 2014, 30 (12) : 1674 - 1680