dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication

被引:1316
作者
Olm, Matthew R. [1 ]
Brown, Christopher T. [1 ]
Brooks, Brandon [1 ]
Banfield, Jillian F. [2 ,3 ]
机构
[1] Univ Calif Berkeley, Dept Plant & Microbial Biol, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Environm Sci Policy & Management, 369 McCone Hall, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Dept Earth & Planetary Sci, Berkeley, CA 94720 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
INFANT GUT; MICROBIAL GENOMES; COLONIZATION; STRAINS;
D O I
10.1038/ismej.2017.126
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The number of microbial genomes sequenced each year is expanding rapidly, in part due to genome-resolved metagenomic studies that routinely recover hundreds of draft-quality genomes. Rapid algorithms have been developed to comprehensively compare large genome sets, but they are not accurate with draft-quality genomes. Here we present dRep, a program that reduces the computational time for pairwise genome comparisons by sequentially applying a fast, inaccurate estimation of genome distance, and a slow, accurate measure of average nucleotide identity. dRep achieves a 28 x increase in speed with perfect recall and precision when benchmarked against previously developed algorithms. We demonstrate the use of dRep for genome recovery from time-series datasets. Each metagenome was assembled separately, and dRep was used to identify groups of essentially identical genomes and select the best genome from each replicate set. This resulted in recovery of significantly more and higher-quality genomes compared to the set recovered using co-assembly.
引用
收藏
页码:2864 / 2868
页数:5
相关论文
共 19 条
  • [11] Evidence for persistent and shared bacterial strains against a background of largely unique gut colonization in hospitalized premature infants
    Raveh-Sadka, Tali
    Firek, Brian
    Sharon, Itai
    Baker, Robyn
    Brown, Christopher T.
    Thomas, Brian C.
    Morowitz, Michael J.
    Banfield, Jillian F.
    [J]. ISME JOURNAL, 2016, 10 (12) : 2817 - 2830
  • [12] Gut bacteria are rarely shared by co-hospitalized premature infants, regardless of necrotizing enterocolitis development
    Raveh-Sadka, Tali
    Thomas, Brian C.
    Singh, Andrea
    Firek, Brian
    Brooks, Brandon
    Castelle, Cindy J.
    Sharon, Itai
    Baker, Robyn
    Good, Misty
    Morowitz, Michael J.
    Banfield, Jillian F.
    [J]. ELIFE, 2015, 4
  • [13] Shifting the genomic gold standard for the prokaryotic species definition
    Richter, Michael
    Rossello-Mora, Ramon
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (45) : 19126 - 19131
  • [14] Sczyrba A, 2017, NAT METHODS, V14, P1063, DOI [10.1038/NMETH.4458, 10.1038/nmeth.4458]
  • [15] Time series community genomics analysis reveals rapid shifts in bacterial species, strains, and phage during infant gut colonization
    Sharon, Itai
    Morowitz, Michael J.
    Thomas, Brian C.
    Costello, Elizabeth K.
    Relman, David A.
    Banfield, Jillian F.
    [J]. GENOME RESEARCH, 2013, 23 (01) : 111 - 120
  • [16] Community structure and metabolism through reconstruction of microbial genomes from the environment
    Tyson, GW
    Chapman, J
    Hugenholtz, P
    Allen, EE
    Ram, RJ
    Richardson, PM
    Solovyev, VV
    Rubin, EM
    Rokhsar, DS
    Banfield, JF
    [J]. NATURE, 2004, 428 (6978) : 37 - 43
  • [17] Microbial species delineation using whole genome sequences
    Varghese, Neha J.
    Mukherjee, Supratim
    Ivanova, Natalia
    Konstantinidis, Konstantinos T.
    Mavrommatis, Kostas
    Kyrpides, Nikos C.
    Pati, Amrita
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (14) : 6761 - 6771
  • [18] Vineis JH, 2016, MBIO, V7, DOI [10.1128/mbio.01713-16, 10.1128/mBio.01713-16]
  • [19] Metagenomic Sequencing with Strain-Level Resolution Implicates Uropathogenic E-coli in Necrotizing Enterocolitis and Mortality in Preterm Infants
    Ward, Doyle V.
    Scholz, Matthias
    Zolfo, Moreno
    Taft, Diana H.
    Schibler, Kurt R.
    Tett, Adrian
    Segata, Nicola
    Morrow, Ardythe L.
    [J]. CELL REPORTS, 2016, 14 (12): : 2912 - 2924