A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform

被引:95
作者
de Muinck, Eric J. [1 ]
Trosvik, Pal [1 ]
Gilfillan, Gregor D. [2 ,3 ]
Hov, Johannes R. [3 ,4 ,5 ]
Sundaram, Arvind Y. M. [2 ,3 ]
机构
[1] Univ Oslo, Dept Biosci, Ctr Ecol & Evolutionary Synth, Oslo, Norway
[2] Oslo Univ Hosp, Dept Med Genet, Oslo, Norway
[3] Univ Oslo, Oslo, Norway
[4] Oslo Univ Hosp, Rikshosp, Norwegian PSC Res Ctr, Oslo, Norway
[5] Oslo Univ Hosp, Rikshosp, Res Inst Internal Med, Oslo, Norway
关键词
16S rRNA gene amplicon sequencing; Illumina library preparation; Indexed PCR; Mock community; Environmental sequencing; Benchmarking; PCR bias; Chimera formation; BIAS; AMPLIFICATION; PRIMERS; SEARCH;
D O I
10.1186/s40168-017-0279-1
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Background: Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized. Methods: We present a triple-index amplicon sequencing strategy to sequence large numbers of samples at significantly lower c ost and in a shorter timeframe compared to existing methods. The design employs a two-stage PCR protocol, incorpo rating three barcodes to each sample, with the possibility to add a fourth-index. It also includes heterogeneity spacers to overcome low complexity issues faced when sequencing amplicons on Illumina platforms. Results: The library preparation method was extensively benchmarked through analysis of a mock community in order to assess biases introduced by sample indexing, number of PCR cycles, and template concentration. We further evaluated the method through re-sequencing of a standardized environmental sample. Finally, we evaluated our protocol on a set of fecal samples from a small cohort of healthy adults, demonstrating good performance in a realistic experimental setting. Between-sample variation was mainly related to batch effects, such as DNA extraction, while sample indexing was also a significant source of bias. PCR cycle number strongly influenced chimera formation and affected relative abundance estimates of species with high GC content. Libraries were sequenced using the Illumina HiSeq and MiSeq platforms to demonstrate that this protocol is highly scalable to sequence thousands of samples at a very low cost. Conclusions: Here, we provide the most comprehensive study of performance and bias inherent to a 16S rRNA gene amplicon sequencing method to date. Triple-indexing greatly reduces the number of long custom DNA oligos required for library preparation, while the inclusion of variable length heterogeneity spacers minimizes the need for PhiX spike-in. This design results in a significant cost reduction of highly multiplexed amplicon sequencing. The biases we characterize highlight the need for highly standardized protocols. Reassuringly, we find that the biological signal is a far stronger structuring factor than the various sources of bias.
引用
收藏
页数:15
相关论文
共 34 条
[1]   Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries [J].
Aird, Daniel ;
Ross, Michael G. ;
Chen, Wei-Sheng ;
Danielsson, Maxwell ;
Fennell, Timothy ;
Russ, Carsten ;
Jaffe, David B. ;
Nusbaum, Chad ;
Gnirke, Andreas .
GENOME BIOLOGY, 2011, 12 (02)
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
[Anonymous], 2013, IlluminaTechnical Note 15044223)
[4]  
[Anonymous], 2014, 7702014035 ILL
[5]   Regionalized GC content of template DNA as a predictor of PCR success [J].
Benita, Y ;
Oosting, RS ;
Lok, MC ;
Wise, MJ ;
Humphery-Smith, I .
NUCLEIC ACIDS RESEARCH, 2003, 31 (16)
[6]   Barcoded Primers Used in Multiplex Amplicon Pyrosequencing Bias Amplification [J].
Berry, David ;
Ben Mahfoudh, Karim ;
Wagner, Michael ;
Loy, Alexander .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2011, 77 (21) :7846-7849
[7]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[8]  
Bushnell B., 2016, BBMap Short Read Aligner
[9]   Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Huntley, James ;
Fierer, Noah ;
Owens, Sarah M. ;
Betley, Jason ;
Fraser, Louise ;
Bauer, Markus ;
Gormley, Niall ;
Gilbert, Jack A. ;
Smith, Geoff ;
Knight, Rob .
ISME JOURNAL, 2012, 6 (08) :1621-1624
[10]   A comprehensive benchmarking study of protocols and sequencing platforms for 16S rRNA community profiling [J].
D'Amore, Rosalinda ;
Ijaz, Umer Zeeshan ;
Schirmer, Melanie ;
Kenny, John G. ;
Gregory, Richard ;
Darby, Alistair C. ;
Shakya, Migun ;
Podar, Mircea ;
Quince, Christopher ;
Hall, Neil .
BMC GENOMICS, 2016, 17