Short Read Lengths Recover Ecological Patterns in 16S rRNA Gene Amplicon Data

被引:0
|
作者
Jurburg, Stephanie D. [1 ,2 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Environm Microbiol, Leipzig, Germany
[2] German Ctr Integrat Biodivers Res iDiv, Leipzig, Germany
关键词
bacteria; bioinformatics; data reuse; metabarcoding; microbiome; BACTERIOPLANKTON COMMUNITIES; SEQUENCE-ANALYSIS; DIVERSITY; IDENTIFICATION;
D O I
10.1111/1755-0998.14102
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
16S rRNA gene metabarcoding, the study of amplicon sequences of the 16S rRNA gene from mixed environmental samples, is an increasingly popular and accessible method for assessing bacterial communities across a wide range of environments. As metabarcoding sequence data archives continue to grow, data reuse will likely become an important source of novel insights into the ecology of microbes. While recent work has demonstrated the benefits of longer read lengths for the study of microbial communities from 16S rRNA gene segments, no studies have explored the use of shorter (< 200 bp) read lengths in the context of data reuse. Nevertheless, this information is essential to improve the reuse and comparability of metabarcoding data across existing datasets. This study reanalyzed nine 16S rRNA datasets targeting aquatic, animal-associated and soil microbiomes, and evaluated how processing the sequence data across a range of read lengths affected the resulting taxonomic assignments, biodiversity metrics and differential (i.e., before-after treatment) analyses. Short read lengths successfully recovered ecological patterns and allowed for the use of more sequences. Limited increases in resolution were observed beyond 150 bp reads across environments. Furthermore, abundance-weighted diversity metrics (e.g., Inverse Simpson index, Morisita-Horn dissimilarities or weighted Unifrac distances) were more robust to variation in read lengths. Read lengths alone contributed to consistent increases in the total number of ASVs detected, highlighting the need to consider metabarcoding-derived diversity estimates within the context of the bioinformatics parameters selected. This study provides evidence-based guidelines for the processing of short reads.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Seasonal dynamics of lotic bacterial communities assessed by 16S rRNA gene amplicon deep sequencing
    Paruch, Lisa
    Paruch, Adam M.
    Eiken, Hans Geir
    Skogen, Monica
    Sorheim, Roald
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [32] Improving environmental monitoring of Vibrionaceae in coastal ecosystems through 16S rRNA gene amplicon sequencing
    Elisa Banchi
    Vincenzo Manna
    Viviana Fonti
    Cinzia Fabbro
    Mauro Celussi
    Environmental Science and Pollution Research, 2022, 29 : 67466 - 67482
  • [33] Strengths and Limitations of 16S rRNA Gene Amplicon Sequencing in Revealing Temporal Microbial Community Dynamics
    Poretsky, Rachel
    Rodriguez-R, Luis M.
    Luo, Chengwei
    Tsementzi, Despina
    Konstantinidis, Konstantinos T.
    PLOS ONE, 2014, 9 (04):
  • [34] Decontamination of 16S rRNA gene amplicon sequence datasets based on bacterial load assessment by qPCR
    Lazarevic, Vladimir
    Gaia, Nadia
    Girard, Myriam
    Schrenzel, Jacques
    BMC MICROBIOLOGY, 2016, 16
  • [35] Microbial community assembly across agricultural soil mineral mesocosms revealed by 16S rRNA gene amplicon sequencing data
    Lee, Daniel
    Oliveira, Fernanda C. C.
    Conant, Richard T.
    Kim, Minjae
    DATA IN BRIEF, 2024, 57
  • [36] metaSPARSim: a 16S rRNA gene sequencing count data simulator
    Ilaria Patuzzi
    Giacomo Baruzzo
    Carmen Losasso
    Antonia Ricci
    Barbara Di Camillo
    BMC Bioinformatics, 20
  • [37] metaSPARSim: a 16S rRNA gene sequencing count data simulator
    Patuzzi, Ilaria
    Baruzzo, Giacomo
    Losasso, Carmen
    Ricci, Antonia
    Di Camillo, Barbara
    BMC BIOINFORMATICS, 2019, 20 (Suppl 9)
  • [38] Bacterial communities of Antarctic lichens explored by gDNA and cDNA 16S rRNA gene amplicon sequencing
    Woltynska, Aleksandra
    Gawor, Jan
    Olech, Maria A.
    Gorniak, Dorota
    Grzesiak, Jakub
    FEMS MICROBIOLOGY ECOLOGY, 2023, 99 (03)
  • [39] Decontamination of 16S rRNA gene amplicon sequence datasets based on bacterial load assessment by qPCR
    Vladimir Lazarevic
    Nadia Gaïa
    Myriam Girard
    Jacques Schrenzel
    BMC Microbiology, 16
  • [40] Influence of PCR cycle number on 16S rRNA gene amplicon sequencing of low biomass samples
    Witzke, Monica C.
    Gullic, Alexis
    Yang, Peggy
    Bivens, Nathan J.
    Adkins, Pamela R. F.
    Ericsson, Aaron C.
    JOURNAL OF MICROBIOLOGICAL METHODS, 2020, 176