Ultra-accurate microbial amplicon sequencing with synthetic long reads

被引:72
作者
Callahan, Benjamin J. [1 ,2 ]
Grinevich, Dmitry [1 ]
Thakur, Siddhartha [1 ]
Balamotis, Michael A. [3 ]
Ben Yehezkel, Tuval [3 ]
机构
[1] North Carolina State Univ, Dept Populat Hlth & Pathobiol, Coll Vet Med, Raleigh, NC 27695 USA
[2] North Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[3] Loop Genom, San Jose, CA USA
关键词
Synthetic long reads; Amplicon sequencing; Metagenomics; Long-read sequencing; ALLELE DISCOVERY; GENOME; RESOLUTION; 16S; SEQ;
D O I
10.1186/s40168-021-01072-3
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Background: Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge. Methods: Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads. Results: LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens. Conclusions: The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics.
引用
收藏
页数:13
相关论文
共 39 条
[1]   Lack of Evidence for Plague or Anthrax on the New York City Subway [J].
Ackelsberg, Joel ;
Rakeman, Jennifer ;
Hughes, Scott ;
Petersen, Jeannine ;
Mead, Paul ;
Schriefer, Martin ;
Kingry, Luke ;
Hoffmaster, Alex ;
Gee, Jay E. .
CELL SYSTEMS, 2015, 1 (01) :4-5
[2]   Modern Methods for Delineating Metagenomic Complexity [J].
Afshinnekoo, Ebrahim ;
Meydan, Cem ;
Chowdhury, Shanin ;
Jaroudi, Dyala ;
Boyer, Collin ;
Bernstein, Nick ;
Maritz, Julia M. ;
Reeves, Darryl ;
Gandara, Jorge ;
Chhangawala, Sagar ;
Ahsanuddin, Sofia ;
Simmons, Amber ;
Nessel, Timothy ;
Sundaresh, Bharathi ;
Pereira, Elizabeth ;
Jorgensen, Ellen ;
Kolokotronis, Sergios-Orestis ;
Kirchberger, Nell ;
Garcia, Isaac ;
Gandara, David ;
Dhanraj, Sean ;
Nawrin, Tanzina ;
Saletore, Yogesh ;
Alexander, Noah ;
Vijay, Priyanka ;
Henaff, Elizabeth M. ;
Zumbo, Paul ;
Walsh, Michael ;
O'Mullan, Gregory D. ;
Tighe, Scott ;
Dudley, Joel T. ;
Dunaif, Anya ;
Ennis, Sean ;
O'Halloran, Eoghan ;
Magalhaes, Tiago R. ;
Boone, Braden ;
Jones, Angela L. ;
Muth, Theodore R. ;
Paolantonio, Katie Schneider ;
Alter, Elizabeth ;
Schadt, Eric E. ;
Garbarino, Jeanne ;
Prill, Robert J. ;
Carlton, Jane M. ;
Levy, Shawn ;
Mason, Christopher E. .
CELL SYSTEMS, 2015, 1 (01) :6-7
[3]   Geospatial Resolution of Human and Bacterial Diversity with City-Scale Metagenomics (vol 1, pg 72, 2015) [J].
Afshinnekoo, Ebrahim ;
Meydan, Cem ;
Chowdhury, Shanin ;
Jaroudi, Dyala ;
Boyer, Collin ;
Bernstein, Nick ;
Maritz, Julia M. ;
Reeves, Darryl ;
Gandara, Jorge ;
Chhangawala, Sagar ;
Ahsanuddin, Sofia ;
Simmons, Amber ;
Nessel, Timothy ;
Sundaresh, Bharathi ;
Pereira, Elizabeth ;
Jorgensen, Ellen ;
Kolokotronis, Sergios-Orestis ;
Kirchberger, Nell ;
Garcia, Isaac ;
Gandara, David ;
Dhanraj, Sean ;
Nawrin, Tanzina ;
Saletore, Yogesh ;
Alexander, Noah ;
Vijay, Priyanka ;
Henaff, Elizabeth M. ;
Zumbo, Paul ;
Walsh, Michael ;
O'Mullan, Gregory D. ;
Tighe, Scott ;
Dudley, Joel T. ;
Dunaif, Anya ;
Ennis, Sean ;
O'Halloran, Eoghan ;
Magalhaes, Tiago R. ;
Boone, Braden ;
Jones, Angela L. ;
Muth, Theodore R. ;
Paolantonio, Katie Schneider ;
Alter, Elizabeth ;
Schadt, Eric E. ;
Garbarino, Jeanne ;
Prill, Robert J. ;
Carlton, Jane M. ;
Levy, Shawn ;
Mason, Christopher E. .
CELL SYSTEMS, 2015, 1 (01) :97-+
[4]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[5]   Improved annotation of the domestic pig genome through integration of Iso-Seq and RNA-seq data [J].
Beiki, H. ;
Liu, H. ;
Huang, J. ;
Manchanda, N. ;
Nonneman, D. ;
Smith, T. P. L. ;
Reecy, J. M. ;
Tuggle, C. K. .
BMC GENOMICS, 2019, 20 (1)
[6]   Investigation of a COVID-19 outbreak in Germany resulting from a single travel-associated primary case: a case series [J].
Boehmer, Merle M. ;
Buchholz, Udo ;
Corman, Victor M. ;
Hoch, Martin ;
Katz, Katharina ;
Marosevic, Durdica, V ;
Boehm, Stefanie ;
Woudenberg, Tom ;
Ackermann, Nikolaus ;
Konrad, Regina ;
Eberle, Ute ;
Treis, Bianca ;
Dangel, Alexandra ;
Bengs, Katja ;
Fingerle, Volker ;
Berger, Anja ;
Hoermansdorfer, Stefan ;
Ippisch, Siegfried ;
Wicklein, Bernd ;
Grahl, Andreas ;
Poertner, Kirsten ;
Muller, Nadine ;
Zeitlmann, Nadine ;
Boender, T. Sonia ;
Cai, Wei ;
Reich, Andreas ;
an der Heiden, Maria ;
Rexroth, Ute ;
Hamouda, Osamah ;
Schneider, Julia ;
Veith, Talitha ;
Muehlemann, Barbara ;
Woelfel, Roman ;
Antwerpen, Markus ;
Walter, Mathias ;
Protzer, Ulrike ;
Liebl, Bernhard ;
Haas, Walter ;
Sing, Andreas ;
Drosten, Christian ;
Zapf, Andreas .
LANCET INFECTIOUS DISEASES, 2020, 20 (08) :920-928
[7]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[8]   Systematic Profiling of Full-Length Ig and TCR Repertoire Diversity in Rhesus Macaque through Long Read Transcriptome Sequencing [J].
Brochu, Hayden N. ;
Tseng, Elizabeth ;
Smith, Elise ;
Thomas, Matthew J. ;
Jones, Aiden M. ;
Diveley, Kayleigh R. ;
Law, Lynn ;
Hansen, Scott G. ;
Picker, Louis J. ;
Gale, Michael, Jr. ;
Peng, Xinxia .
JOURNAL OF IMMUNOLOGY, 2020, 204 (12) :3434-3444
[9]   A method for high precision sequencing of near full-length 16S rRNA genes on an Illumina MiSeq [J].
Burke, Catherine M. ;
Darling, Aaron E. .
PEERJ, 2016, 4
[10]   High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution [J].
Callahan, Benjamin J. ;
Wong, Joan ;
Heiner, Cheryl ;
Oh, Steve ;
Theriot, Casey M. ;
Gulati, Ajay S. ;
McGill, Sarah K. ;
Dougherty, Michael K. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (18)