Ultra-accurate microbial amplicon sequencing with synthetic long reads

被引:64
作者
Callahan, Benjamin J. [1 ,2 ]
Grinevich, Dmitry [1 ]
Thakur, Siddhartha [1 ]
Balamotis, Michael A. [3 ]
Ben Yehezkel, Tuval [3 ]
机构
[1] North Carolina State Univ, Dept Populat Hlth & Pathobiol, Coll Vet Med, Raleigh, NC 27695 USA
[2] North Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[3] Loop Genom, San Jose, CA USA
关键词
Synthetic long reads; Amplicon sequencing; Metagenomics; Long-read sequencing; ALLELE DISCOVERY; GENOME; RESOLUTION; 16S; SEQ;
D O I
10.1186/s40168-021-01072-3
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Background: Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge. Methods: Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads. Results: LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens. Conclusions: The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
    Li, Gaoyang
    Liu, Yongzhuang
    Li, Deying
    Liu, Bo
    Li, Junyi
    Hu, Yang
    Wang, Yadong
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
  • [32] DuBA.flow-A Low-Cost, Long-Read Amplicon Sequencing Workflow for the Validation of Synthetic DNA Constructs
    Rojas, Adan A. Ramirez
    Brinkmann, Cedric K.
    Koebel, Tania S.
    Schindler, Daniel
    ACS SYNTHETIC BIOLOGY, 2024, 13 (02): : 457 - 465
  • [33] Analyzing rare mutations in metagenomes assembled using long and accurate reads
    Fedarko, Marcus W.
    Kolmogorov, Mikhail
    Pevzner, Pavel A.
    GENOME RESEARCH, 2022, 32 (11-12) : 2119 - 2133
  • [34] Short reads from honey bee (Apis sp.) sequencing projects reflect microbial associate diversity
    Gerth, Michael
    Hurst, Gregory D. D.
    PEERJ, 2017, 5
  • [35] CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
    Sunyoung Kwon
    Byunghan Lee
    Sungroh Yoon
    BMC Bioinformatics, 15
  • [36] Effects of marine sediment as agricultural substrate on soil microbial diversity: an amplicon sequencing study
    Nunez-Gomez, Damaris
    Melgarejo, Pablo
    Martinez-Nicolas, Juan Jose
    Hernandez, Francisca
    Martinez-Font, Rafael
    Lidon, Vicente
    Legua, Pilar
    ENVIRONMENTAL MICROBIOME, 2023, 18 (01)
  • [37] Analysing Microbial Community Composition through Amplicon Sequencing: From Sampling to Hypothesis Testing
    Hugerth, Luisa W.
    Andersson, Anders F.
    FRONTIERS IN MICROBIOLOGY, 2017, 8
  • [38] cloudSPAdes: assembly of synthetic long reads using de Bruijn graphs
    Tolstoganov, Ivan
    Bankevich, Anton
    Chen, Zhoutao
    Pevzner, Pavel A.
    BIOINFORMATICS, 2019, 35 (14) : I61 - I70
  • [39] Multi-locus and long amplicon sequencing approach to study microbial diversity at species level using the MinION™ portable nanopore sequencer
    Benitez-Paez, Alfonso
    Sanz, Yolanda
    GIGASCIENCE, 2017, 6 (07):
  • [40] Amplicon sequencing with internal standards yields accurate picocyanobacteria cell abundances as validated with flow cytometry
    Jones-Kellett, Alexandra E.
    McNichol, Jesse C.
    Raut, Yubin
    Cain, Kelsy R.
    Ribalet, Francois
    Armbrust, E. Virginia
    Follows, Michael J.
    Fuhrman, Jed A.
    ISME COMMUNICATIONS, 2024, 4 (01):