Amplicon Sequence Variants Artificially Split Bacterial Genomes into Separate Clusters

被引:121
作者
Schloss, Patrick D. [1 ]
机构
[1] Univ Michigan, Dept Microbiol & Immunol, Ann Arbor, MI 48109 USA
关键词
16S rRNA gene; ASV; OTU; bioinformatics; microbial communities; microbial ecology; microbiome; RIBOSOMAL-RNA GENES; ARCHAEA;
D O I
10.1128/mSphere.00191-21
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Amplicon sequencing variants (ASVs) have been proposed as an alternative to operational taxonomic units (OTUs) for analyzing microbial communities. ASVs have grown in popularity, in part because of a desire to reflect a more refined level of taxonomy since they do not cluster sequences based on a distance-based threshold. However, ASVs and the use of overly narrow thresholds to identify OTUs increase the risk of splitting a single genome into separate clusters. To assess this risk, I analyzed the intragenomic variation of 16S rRNA genes from the bacterial genomes represented in an rrn copy number database, which contained 20,427 genomes from 5,972 species. As the number of copies of the 16S rRNA gene increased in a genome, the number of ASVs also increased. There was an average of 0.58 ASVs per copy of the 16S rRNA gene for full-length 165 rRNA genes. It was necessary to use a distance threshold of 5.25% to cluster full-length ASVs from the same genome into a single OTU with 95% confidence for genomes with 7 copies of the 16S rRNA, such as Escherichia coli. This research highlights the risk of splitting a single bacterial genome into separate clusters when ASVs are used to analyze 165 rRNA gene sequence data. Although there is also a risk of clustering ASVs from different species into the same OTU when using broad distance thresholds, these risks are of less concern than artificially splitting a genome into separate ASVs and OTUs. IMPORTANCE 16S rRNA gene sequencing has engendered significant interest in studying microbial communities. There has been tension between trying to classify 16S rRNA gene sequences to increasingly lower taxonomic levels and the reality that those levels were defined using more sequence and physiological information than is available from a fragment of the 16S rRNA gene. Furthermore, the naming of bacterial taxa reflects the biases of those who name them. One motivation for the recent push to adopt ASVs in place of OTUs in microbial community analyses is to allow researchers to perform their analyses at the finest possible level that reflects species-level taxonomy. The current research is significant because it quantifies the risk of artificially splitting bacterial genomes into separate clusters. Far from providing a better representation of bacterial taxonomy and biology, the ASV approach can lead to conflicting inferences about the ecology of different ASVs from the same genome.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 27 条
[1]   Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns [J].
Amir, Amnon ;
McDonald, Daniel ;
Navas-Molina, Jose A. ;
Kopylova, Evguenia ;
Morton, James T. ;
Xu, Zhenjiang Zech ;
Kightley, Eric P. ;
Thompson, Luke R. ;
Hyde, Embriette R. ;
Gonzalez, Antonio ;
Knight, Rob .
MSYSTEMS, 2017, 2 (02)
[2]   Evolution, genomics and epidemiology of Pseudomonas syringae: Challenges in Bacterial Molecular Plant Pathology [J].
Baltrus, David A. ;
McCann, Honour C. ;
Guttman, David S. .
MOLECULAR PLANT PATHOLOGY, 2017, 18 (01) :152-168
[3]   A Genus Definition for Bacteria and Archaea Based on a Standard Genome Relatedness Index [J].
Barco, R. A. ;
Garrity, G. M. ;
Scott, J. J. ;
Amend, J. P. ;
Nealson, K. H. ;
Emerson, D. .
MBIO, 2020, 11 (01)
[4]   Exact sequence variants should replace operational taxonomic units in marker-gene data analysis [J].
Callahan, Benjamin J. ;
McMurdie, Paul J. ;
Holmes, Susan P. .
ISME JOURNAL, 2017, 11 (12) :2639-2643
[5]  
Callahan BJ, 2016, NAT METHODS, V13, P581, DOI [10.1038/NMETH.3869, 10.1038/nmeth.3869]
[6]  
Edgar R.C., 2016, bioRxiv, DOI [10.1101/081257, DOI 10.1101/081257]
[7]   Updating the 97% identity threshold for 16S ribosomal RNA OTUs [J].
Edgar, Robert C. .
BIOINFORMATICS, 2018, 34 (14) :2371-2375
[8]   Minimum entropy decomposition: Unsupervised oligotyping for sensitive partitioning of high-throughput marker gene sequences [J].
Eren, A. Murat ;
Morrison, Hilary G. ;
Lescault, Pamela J. ;
Reveillaud, Julie ;
Vineis, Joseph H. ;
Sogin, Mitchell L. .
ISME JOURNAL, 2015, 9 (04) :968-979
[9]   DNA-DNA hybridization values and their relationship to whole-genome sequence similarities [J].
Goris, Johan ;
Konstantinidis, Konstantinos T. ;
Klappenbach, Joel A. ;
Coenye, Tom ;
Vandamme, Peter ;
Tiedje, James M. .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2007, 57 :81-91
[10]   Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis [J].
Johnson, Jethro S. ;
Spakowicz, Daniel J. ;
Hong, Bo-Young ;
Petersen, Lauren M. ;
Demkowicz, Patrick ;
Chen, Lei ;
Leopold, Shana R. ;
Hanson, Blake M. ;
Agresta, Hanako O. ;
Gerstein, Mark ;
Sodergren, Erica ;
Weinstock, George M. .
NATURE COMMUNICATIONS, 2019, 10 (1)