BEExact: a Metataxonomic Database Tool for High-Resolution Inference of Bee-Associated Microbial Communities

被引:30
作者
Daisley, Brendan A. [1 ,2 ]
Reid, Gregor [1 ,2 ,3 ]
机构
[1] Univ Western Ontario, Dept Microbiol & Immunol, London, ON, Canada
[2] Canadian Ctr Human Microbiome & Probiot Res, London, ON, Canada
[3] Schulich Sch Med, Dept Surg, London, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
microbiota; bees; 16S rRNA gene sequencing; microbial ecology; bioinformatics; host-microbe interactions; polymicrobial communities; microbial; phylogenetics; endosymbionts; environmental microbiology; invertebrate-microbe; interactions; microbial communities; taxonomy; metataxonomics; GUT MICROBIOTA; INFECTION; CLASSIFICATION; BACTERIA; APIS;
D O I
10.1128/mSystems.00082-21
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
High-throughput 16S rRNA gene sequencing technologies have robust potential to improve our understanding of bee (Hymenoptera: Apoidea)-associated microbial communities and their impact on hive health and disease. Despite recent computation algorithms now permitting exact inferencing of high-resolution exact amplicon sequence variants (ASVs), the taxonomic classification of these ASVs remains a challenge due to inadequate reference databases. To address this, we assemble a comprehensive data set of all publicly available bee-associated 16S rRNA gene sequences, systematically annotate poorly resolved identities via inclusion of 618 placeholder labels for uncultivated microbial dark matter, and correct for phylogenetic inconsistencies using a complementary set of distance-based and maximum likelihood correction strategies. To benchmark the resultant database (BEExact), we compare performance against all existing reference databases in silico using a variety of classifier algorithms to produce probabilistic confidence scores. We also validate realistic classification rates on an independent set of similar to 234 million short-read sequences derived from 32 studies encompassing 50 different bee types (36 eusocial and 14 solitary). Species-level classification rates on short-read ASVs range from 80 to 90% using BEExact (with similar to 20% due to "bxid" placeholder names), whereas only similar to 30% at best can be resolved with current universal databases. A series of data-driven recommendations are developed for future studies. We conclude that BEExact (https://github.com/bdaisley/BEExact) enables accurate and standardized microbiota profiling across a broad range of bee species-two factors of key importance to reproducibility and meaningful knowledge exchange within the scientific community that together, can enhance the overall utility and ecological relevance of routine 16S rRNA gene-based sequencing endeavors. IMPORTANCE The failure of current universal taxonomic databases to support the rapidly expanding field of bee microbiota research has led to many investigators relying on "in-house" reference sets or manual classification of sequence reads (usually based on BLAST searches), often with vague identity thresholds and subjective taxonomy choices. This time-consuming, error-and bias-prone process lacks standardization, cripples the potential for comparative cross-study analysis, and in many cases is likely to incorrectly sway study conclusions. BEExact is structured on and leverages several complementary bioinformatic techniques to enable refined inference of bee host-associated microbial communities without any other methodological modifications necessary. It also bridges the gap between current practical outcomes (i.e., phylotype-to-genus level constraints with 97% operational taxonomic units [OTUs]) and the theoretical resolution (i.e., species-to-strain level classification with 100% ASVs) attainable in future microbiota investigations. Other niche habitats could also likely benefit from customized database curation via implementation of the novel approaches introduced in this study.
引用
收藏
页数:28
相关论文
共 123 条
[1]  
AMIR A, 2017, MSYSTEMS, V2, DOI DOI 10.1128/MSYSTEMS.00191-16
[2]   Microbial Ecology of the Hive and Pollination Landscape: Bacterial Associates from Floral Nectar, the Alimentary Tract and Stored Food of Honey Bees (Apis mellifera) [J].
Anderson, Kirk E. ;
Sheehan, Timothy H. ;
Mott, Brendon M. ;
Maes, Patrick ;
Snyder, Lucy ;
Schwan, Melissa R. ;
Walton, Alexander ;
Jones, Beryl M. ;
Corby-Harris, Vanessa .
PLOS ONE, 2013, 8 (12)
[3]  
[Anonymous], 2019, MICROB ECOL, DOI DOI 10.1007/s00248-018-1300-3
[4]   Discovery of Paenibacillus larvae ERIC V: Phenotypic and genomic comparison to genotypes ERIC I-IV reveal different inventories of virulence factors which correlate with epidemiological prevalences of American Foulbrood [J].
Beims, Hannes ;
Bunk, Boyke ;
Erler, Silvio ;
Mohr, Kathrin, I ;
Sproeer, Cathrin ;
Pradella, Silke ;
Guenther, Gabi ;
Rohde, Manfred ;
von der Ohe, Werner ;
Steinert, Michael .
INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2020, 310 (02)
[5]   Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2′s q2-feature-classifier plugin [J].
Bokulich, Nicholas A. ;
Kaehler, Benjamin D. ;
Rideout, Jai Ram ;
Dillon, Matthew ;
Bolyen, Evan ;
Knight, Rob ;
Huttley, Gavin A. ;
Caporaso, J. Gregory .
MICROBIOME, 2018, 6
[6]   Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2 [J].
Bolyen, Evan ;
Rideout, Jai Ram ;
Dillon, Matthew R. ;
Bokulich, NicholasA. ;
Abnet, Christian C. ;
Al-Ghalith, Gabriel A. ;
Alexander, Harriet ;
Alm, Eric J. ;
Arumugam, Manimozhiyan ;
Asnicar, Francesco ;
Bai, Yang ;
Bisanz, Jordan E. ;
Bittinger, Kyle ;
Brejnrod, Asker ;
Brislawn, Colin J. ;
Brown, C. Titus ;
Callahan, Benjamin J. ;
Caraballo-Rodriguez, Andres Mauricio ;
Chase, John ;
Cope, Emily K. ;
Da Silva, Ricardo ;
Diener, Christian ;
Dorrestein, Pieter C. ;
Douglas, Gavin M. ;
Durall, Daniel M. ;
Duvallet, Claire ;
Edwardson, Christian F. ;
Ernst, Madeleine ;
Estaki, Mehrbod ;
Fouquier, Jennifer ;
Gauglitz, Julia M. ;
Gibbons, Sean M. ;
Gibson, Deanna L. ;
Gonzalez, Antonio ;
Gorlick, Kestrel ;
Guo, Jiarong ;
Hillmann, Benjamin ;
Holmes, Susan ;
Holste, Hannes ;
Huttenhower, Curtis ;
Huttley, Gavin A. ;
Janssen, Stefan ;
Jarmusch, Alan K. ;
Jiang, Lingjing ;
Kaehler, Benjamin D. ;
Bin Kang, Kyo ;
Keefe, Christopher R. ;
Keim, Paul ;
Kelley, Scott T. ;
Knights, Dan .
NATURE BIOTECHNOLOGY, 2019, 37 (08) :852-857
[7]  
Bonilla-Rosso, 2019, ACETOBACTERACEAE HON, DOI [10.1101/861260, DOI 10.1101/861260]
[8]   Behavior and gut bacteria of Partamona helleri under sublethal exposure to a bioinsecticide and a leaf fertilizer [J].
Botina, L. L. ;
Velez, M. ;
Barbosa, W. F. ;
Mendonca, A. C. ;
Pylro, V. S. ;
Totola, M. R. ;
Martins, C. F. .
CHEMOSPHERE, 2019, 234 :187-195
[9]  
Callahan Ben J, 2016, F1000Res, V5, P1492
[10]   Exact sequence variants should replace operational taxonomic units in marker-gene data analysis [J].
Callahan, Benjamin J. ;
McMurdie, Paul J. ;
Holmes, Susan P. .
ISME JOURNAL, 2017, 11 (12) :2639-2643