Ultra-accurate microbial amplicon sequencing with synthetic long reads

被引:63
|
作者
Callahan, Benjamin J. [1 ,2 ]
Grinevich, Dmitry [1 ]
Thakur, Siddhartha [1 ]
Balamotis, Michael A. [3 ]
Ben Yehezkel, Tuval [3 ]
机构
[1] North Carolina State Univ, Dept Populat Hlth & Pathobiol, Coll Vet Med, Raleigh, NC 27695 USA
[2] North Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[3] Loop Genom, San Jose, CA USA
关键词
Synthetic long reads; Amplicon sequencing; Metagenomics; Long-read sequencing; ALLELE DISCOVERY; GENOME; RESOLUTION; 16S; SEQ;
D O I
10.1186/s40168-021-01072-3
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Background: Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge. Methods: Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads. Results: LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens. Conclusions: The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Analysis of soil microbial communities based on amplicon sequencing of marker genes
    Schoeler, Anne
    Jacquiod, Samuel
    Vestergaard, Gisle
    Schulz, Stefanie
    Schloter, Michael
    BIOLOGY AND FERTILITY OF SOILS, 2017, 53 (05) : 485 - 489
  • [22] Haplotype threading: accurate polyploid phasing from long reads
    Schrinner, Sven D.
    Mari, Rebecca Serra
    Ebler, Jana
    Rautiainen, Mikko
    Seillier, Lancelot
    Reimer, Julia J.
    Usadel, Bjoern
    Marschall, Tobias
    Klau, Gunnar W.
    GENOME BIOLOGY, 2020, 21 (01)
  • [23] LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis
    Ozkurt, Ezgi
    Fritscher, Joachim
    Soranzo, Nicola
    Ng, Duncan Y. K.
    Davey, Robert P.
    Bahram, Mohammad
    Hildebrand, Falk
    MICROBIOME, 2022, 10 (01)
  • [24] LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis
    Ezgi Özkurt
    Joachim Fritscher
    Nicola Soranzo
    Duncan Y. K. Ng
    Robert P. Davey
    Mohammad Bahram
    Falk Hildebrand
    Microbiome, 10
  • [25] Performance of amplicon and shotgun sequencing for accurate biomass estimation in invertebrate community samples
    Bista, Iliana
    Carvalho, Gary R.
    Tang, Min
    Walsh, Kerry
    Zhou, Xin
    Hajibabaei, Mehrdad
    Shokralla, Shadi
    Seymour, Mathew
    Bradley, David
    Liu, Shanlin
    Christmas, Martin
    Creer, Simon
    MOLECULAR ECOLOGY RESOURCES, 2018, 18 (05) : 1020 - 1034
  • [26] Long Reads Are Revolutionizing 20 Years of Insect Genome Sequencing
    Hotaling, Scott
    Sproul, John S.
    Heckenhauer, Jacqueline
    Powell, Ashlyn
    Larracuente, Amanda M.
    Pauls, Steffen U.
    Kelley, Joanna L.
    Frandsen, Paul B.
    GENOME BIOLOGY AND EVOLUTION, 2021, 13 (08):
  • [27] Phasing amplicon sequencing on Illumina Miseq for robust environmental microbial community analysis
    Wu, Liyou
    Wen, Chongqing
    Qin, Yujia
    Yin, Huaqun
    Tu, Qichao
    Van Nostrand, Joy D.
    Yuan, Tong
    Yuan, Menting
    Deng, Ye
    Zhou, Jizhong
    BMC MICROBIOLOGY, 2015, 15
  • [28] Phasing amplicon sequencing on Illumina Miseq for robust environmental microbial community analysis
    Liyou Wu
    Chongqing Wen
    Yujia Qin
    Huaqun Yin
    Qichao Tu
    Joy D. Van Nostrand
    Tong Yuan
    Menting Yuan
    Ye Deng
    Jizhong Zhou
    BMC Microbiology, 15
  • [29] SHIMS 3.0: Highly efficient single-haplotype iterative mapping and sequencing using ultra-long nanopore reads
    Bellott, Daniel W.
    Cho, Ting-Jan
    Jackson, Emily K.
    Skaletsky, Helen
    Hughes, Jennifer F.
    Page, David C.
    PLOS ONE, 2022, 17 (06):
  • [30] NGSpeciesID: DNA barcode and amplicon consensus generation from long-read sequencing data
    Sahlin, Kristoffer
    Lim, Marisa C. W.
    Prost, Stefan
    ECOLOGY AND EVOLUTION, 2021, 11 (03): : 1392 - 1398