Generation of Comprehensive Ecosystem-Specific Reference Databases with Species-Level Resolution by High-Throughput Full-Length 16S rRNA Gene Sequencing and Automated Taxonomy Assignment (AutoTax)

被引:85
作者
Dueholm, Morten Simonsen [1 ]
Andersen, Kasper Skytte [1 ]
McIlroy, Simon Jon [1 ,2 ]
Kristensen, Jannie Munk [1 ]
Yashiro, Erika [1 ]
Karst, Soren Michael [1 ]
Albertsen, Mads [1 ]
Nielsen, Per Halkjaer [1 ]
机构
[1] Aalborg Univ, Ctr Microbial Communities, Dept Chem & Biosci, Aalborg, Denmark
[2] Queensland Univ Technol QUT, Ctr Microbiome Res, Sch Biomed Sci, Translat Res Inst, Woolloongabba, Qld, Australia
关键词
16S RNA; gene sequencing; microbial communities; microbial ecology; taxonomy; wastewater treatment; BACTERIA; ARCHAEA; DIVERSITY; SILVA;
D O I
10.1128/mBio.01557-20
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
High-throughput 16S rRNA gene amplicon sequencing is an essential method for studying the diversity and dynamics of microbial communities. However, this method is presently hampered by the lack of high-identity reference sequences for many environmental microbes in the public 16S rRNA gene reference databases and by the absence of a systematic and comprehensive taxonomy for the uncultured majority. Here, we demonstrate how high-throughput synthetic long-read sequencing can be applied to create ecosystem-specific full-length 16S rRNA gene amplicon sequence variant (FL-ASV) resolved reference databases that include high-identity references (similar to 98.7% identity) for nearly all abundant bacteria (similar to 0.01% relative abundance) using Danish wastewater treatment systems and anaerobic digesters as an example. In addition, we introduce a novel sequence identity-based approach for automated taxonomy assignment (AutoTax) that provides a complete seven-rank taxonomy for all reference sequences, using the SILVA taxonomy as a backbone, with stable placeholder names for unclassified taxa. The FL-ASVs are perfectly suited for the evaluation of taxonomic resolution and bias associated with primers commonly used for amplicon sequencing, allowing researchers to choose those that are ideal for their ecosystem. Reference databases processed with AutoTax greatly improves the classification of short-read 16S rRNA ASVs at the genus- and species-level, compared with the commonly used universal reference databases. Importantly, the placeholder names provide a way to explore the unclassified environmental taxa at different taxonomic ranks, which in combination with in situ analyses can be used to uncover their ecological roles.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 69 条
[1]  
Albertsen M., 2018, bioRxiv, DOI DOI 10.1101/299537, Patent No. 299537
[2]   Back to Basics - The Influence of DNA Extraction and Primer Choice on Phylogenetic Analysis of Activated Sludge Communities [J].
Albertsen, Mads ;
Karst, Soren M. ;
Ziegler, Anja S. ;
Kirkegaard, Rasmus H. ;
Nielsen, Per H. .
PLOS ONE, 2015, 10 (07)
[3]  
Amir A., 2017, MSYSTEMS, V2, DOI DOI 10.1128/MSYSTEMS.00191-16
[4]   Minor revision to V4 region SSU rRNA 806R gene primer greatly increases detection of SAR11 bacterioplankton [J].
Apprill, Amy ;
McNally, Sean ;
Parsons, Rachel ;
Weber, Laura .
AQUATIC MICROBIAL ECOLOGY, 2015, 75 (02) :129-137
[5]   Newly designed 16S rRNA metabarcoding primers amplify diverse and novel archaeal taxa from the environment [J].
Bahram, Mohammad ;
Anslan, Sten ;
Hildebrand, Falk ;
Bork, Peer ;
Tedersoo, Leho .
ENVIRONMENTAL MICROBIOLOGY REPORTS, 2019, 11 (04) :487-494
[6]   Bacterial Communities Associated with the Leaves and the Roots of Arabidopsis thaliana [J].
Bodenhausen, Natacha ;
Horton, Matthew W. ;
Bergelson, Joy .
PLOS ONE, 2013, 8 (02)
[7]   Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2′s q2-feature-classifier plugin [J].
Bokulich, Nicholas A. ;
Kaehler, Benjamin D. ;
Rideout, Jai Ram ;
Dillon, Matthew ;
Bolyen, Evan ;
Knight, Rob ;
Huttley, Gavin A. ;
Caporaso, J. Gregory .
MICROBIOME, 2018, 6
[8]   Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea [J].
Bowers, Robert M. ;
Kyrpides, Nikos C. ;
Stepanauskas, Ramunas ;
Harmon-Smith, Miranda ;
Doud, Devin ;
Reddy, T. B. K. ;
Schulz, Frederik ;
Jarett, Jessica ;
Rivers, Adam R. ;
Eloe-Fadrosh, Emiley A. ;
Tringe, Susannah G. ;
Ivanova, Natalia N. ;
Copeland, Alex ;
Clum, Alicia ;
Becraft, Eric D. ;
Malmstrom, Rex R. ;
Birren, Bruce ;
Podar, Mircea ;
Bork, Peer ;
Weinstock, George M. ;
Garrity, George M. ;
Dodsworth, Jeremy A. ;
Yooseph, Shibu ;
Sutton, Granger ;
Gloeckner, Frank O. ;
Gilbert, Jack A. ;
Nelson, William C. ;
Hallam, Steven J. ;
Jungbluth, Sean P. ;
Ettema, Thijs J. G. ;
Tighe, Scott ;
Konstantinidis, Konstantinos T. ;
Liu, Wen-Tso ;
Baker, Brett J. ;
Rattei, Thomas ;
Eisen, Jonathan A. ;
Hedlund, Brian ;
McMahon, Katherine D. ;
Fierer, Noah ;
Knight, Rob ;
Finn, Rob ;
Cochrane, Guy ;
Karsch-Mizrachi, Ilene ;
Tyson, Gene W. ;
Rinke, Christian ;
Lapidus, Alla ;
Meyer, Folker ;
Yilmaz, Pelin ;
Parks, Donovan H. ;
Eren, A. M. .
NATURE BIOTECHNOLOGY, 2017, 35 (08) :725-731
[9]   A method for high precision sequencing of near full-length 16S rRNA genes on an Illumina MiSeq [J].
Burke, Catherine M. ;
Darling, Aaron E. .
PEERJ, 2016, 4
[10]   High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution [J].
Callahan, Benjamin J. ;
Wong, Joan ;
Heiner, Cheryl ;
Oh, Steve ;
Theriot, Casey M. ;
Gulati, Ajay S. ;
McGill, Sarah K. ;
Dougherty, Michael K. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (18)