NanoAmpli-Seq: a workflow for amplicon sequencing for mixed microbial communities on the nanopore sequencing platform

被引:63
作者
Calus, Szymon T. [1 ]
Ijaz, Umer Z. [1 ]
Pinto, Ameet J. [2 ]
机构
[1] Univ Glasgow, Sch Engn, Glasgow G12 8QQ, Lanark, Scotland
[2] Northeastern Univ, Dept Civil & Environm Engn, 417 SN,360 Huntington Ave, Boston, MA 02115 USA
基金
英国自然环境研究理事会; 英国工程与自然科学研究理事会;
关键词
amplicon sequencing; nanopore sequencing; de novo analyses; sequencing accuracy; 16S; ALIGNMENT; RESOLUTION; MAFFT;
D O I
10.1093/gigascience/giy140
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Amplicon sequencing on Illumina sequencing platforms leverages their deep sequencing and multiplexing capacity but is limited in genetic resolution due to short read lengths. While Oxford Nanopore or Pacific Biosciences sequencing platforms overcome this limitation, their application has been limited due to higher error rates or lower data output. Results: In this study, we introduce an amplicon sequencing workflow, i.e., NanoAmpli-Seq, that builds on the intramolecular-ligated nanopore consensus sequencing (INC-Seq) approach and demonstrate its application for full-length 16S rRNA gene sequencing. NanoAmpli-Seq includes vital improvements to the INC-Seq protocol that reduces sample processing time while significantly improving sequence accuracy. The developed protocol includes chopSeq software for fragmentation and read orientation correction of INC-Seq consensus reads while nanoClust algorithm was designed for read partitioning-based de novo clustering and within cluster consensus calling to obtain accurate full-length 16S rRNA gene sequences. Conclusions: NanoAmpli-Seq accurately estimates the diversity of tested mock communities with average consensus sequence accuracy of 99.5% for 2D and 1D(2) sequencing on the nanopore sequencing platform. Nearly all residual errors in NanoAmpli-Seq sequences originate from deletions in homopolymer regions, indicating that homopolymer aware base calling or error correction may allow for sequencing accuracy comparable to short-read sequencing platforms.
引用
收藏
页数:16
相关论文
共 47 条
[1]  
[Anonymous], COWPLOT STREAMLINED
[2]  
[Anonymous], 2017, Miscellaneous functions for Grid Graphics R package gridExtra version 2.3
[3]   Multi-locus and long amplicon sequencing approach to study microbial diversity at species level using the MinION™ portable nanopore sequencer [J].
Benitez-Paez, Alfonso ;
Sanz, Yolanda .
GIGASCIENCE, 2017, 6 (07)
[4]   Species-level resolution of 16S rRNA gene amplicons sequenced through the MinION™ portable nanopore sequencer [J].
Benitez-Paez, Alfonso ;
Portune, Kevin J. ;
Sanz, Yolanda .
GIGASCIENCE, 2016, 5
[5]   A method for high precision sequencing of near full-length 16S rRNA genes on an Illumina MiSeq [J].
Burke, Catherine M. ;
Darling, Aaron E. .
PEERJ, 2016, 4
[6]  
Calus ST, 2018, GIGASCIENCE DATABASE, DOI [10.5524/100, DOI 10.5524/100]
[7]   Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Huntley, James ;
Fierer, Noah ;
Owens, Sarah M. ;
Betley, Jason ;
Fraser, Louise ;
Bauer, Markus ;
Gormley, Niall ;
Gilbert, Jack A. ;
Smith, Geoff ;
Knight, Rob .
ISME JOURNAL, 2012, 6 (08) :1621-1624
[8]   Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Lozupone, Catherine A. ;
Turnbaugh, Peter J. ;
Fierer, Noah ;
Knight, Rob .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 :4516-4522
[9]  
Chin CS, 2016, NAT METHODS, V13, P1050, DOI [10.1038/NMETH.4035, 10.1038/nmeth.4035]
[10]  
Cusco A., 2017, BIORXIV, P167015, DOI [10.1101/167015, DOI 10.1101/167015]