A comprehensive benchmarking study of protocols and sequencing platforms for 16S rRNA community profiling

被引:279
作者
D'Amore, Rosalinda [1 ]
Ijaz, Umer Zeeshan [2 ]
Schirmer, Melanie [2 ]
Kenny, John G. [1 ]
Gregory, Richard [1 ]
Darby, Alistair C. [1 ]
Shakya, Migun [3 ]
Podar, Mircea [4 ]
Quince, Christopher [5 ]
Hall, Neil [1 ]
机构
[1] Univ Liverpool, Inst Integrat Biol, Liverpool L69 7ZB, Merseyside, England
[2] Univ Glasgow, Sch Engn, Glasgow G12 8LT, Lanark, Scotland
[3] Dartmouth Coll, Dept Biol Sci, Hanover, NH 03755 USA
[4] Oak Ridge Natl Lab, Biosci Div, Oak Ridge, TN 37831 USA
[5] Univ Warwick, Warwick Med Sch, Warwick CV4 7AL, England
来源
BMC GENOMICS | 2016年 / 17卷
基金
英国生物技术与生命科学研究理事会;
关键词
MICROBIAL DIVERSITY; GENE; AMPLIFICATION; ALIGNMENT; PRIMERS; SEARCH; BIASES; READS;
D O I
10.1186/s12864-015-2194-9
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: In the last 5 years, the rapid pace of innovations and improvements in sequencing technologies has completely changed the landscape of metagenomic and metagenetic experiments. Therefore, it is critical to benchmark the various methodologies for interrogating the composition of microbial communities, so that we can assess their strengths and limitations. The most common phylogenetic marker for microbial community diversity studies is the 16S ribosomal RNA gene and in the last 10 years the field has moved from sequencing a small number of amplicons and samples to more complex studies where thousands of samples and multiple different gene regions are interrogated. Results: We assembled 2 synthetic communities with an even (EM) and uneven (UM) distribution of archaeal and bacterial strains and species, as metagenomic control material, to assess performance of different experimental strategies. The 2 synthetic communities were used in this study, to highlight the limitations and the advantages of the leading sequencing platforms: MiSeq (Illumina), The Pacific Biosciences RSII, 454 GS-FLX/+ (Roche), and IonTorrent (Life Technologies). We describe an extensive survey based on synthetic communities using 3 experimental designs (fusion primers, universal tailed tag, ligated adaptors) across the 9 hypervariable 16S rDNA regions. We demonstrate that library preparation methodology can affect data interpretation due to different error and chimera rates generated during the procedure. The observed community composition was always biased, to a degree that depended on the platform, sequenced region and primer choice. However, crucially, our analysis suggests that 16S rRNA sequencing is still quantitative, in that relative changes in abundance of taxa between samples can be recovered, despite these biases. Conclusion: We have assessed a range of experimental conditions across several next generation sequencing platforms using the most up-to-date configurations. We propose that the choice of sequencing platform and experimental design needs to be taken into consideration in the early stage of a project by running a small trial consisting of several hypervariable regions to quantify the discriminatory power of each region. We also suggest that the use of a synthetic community as a positive control would be beneficial to identify the potential biases and procedural drawbacks that may lead to data misinterpretation. The results of this study will serve as a guideline for making decisions on which experimental condition and sequencing platform to consider to achieve the best microbial profiling.
引用
收藏
页数:20
相关论文
共 40 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   The Use of Coded PCR Primers Enables High-Throughput Sequencing of Multiple Homolog Amplification Products by 454 Parallel Sequencing [J].
Binladen, Jonas ;
Gilbert, M. Thomas P. ;
Bollback, Jonathan P. ;
Panitz, Frank ;
Bendixen, Christian ;
Nielsen, Rasmus ;
Willerslev, Eske .
PLOS ONE, 2007, 2 (02)
[3]   Targeted Amplicon Sequencing (TAS): A Scalable Next-Gen Approach to Multilocus, Multitaxa Phylogenetics [J].
Bybee, Seth M. ;
Bracken-Grissom, Heather ;
Haynes, Benjamin D. ;
Hermansen, Russell A. ;
Byers, Robert L. ;
Clement, Mark J. ;
Udall, Joshua A. ;
Wilcox, Edward R. ;
Crandall, Keith A. .
GENOME BIOLOGY AND EVOLUTION, 2011, 3 :1312-1323
[4]   Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Huntley, James ;
Fierer, Noah ;
Owens, Sarah M. ;
Betley, Jason ;
Fraser, Louise ;
Bauer, Markus ;
Gormley, Niall ;
Gilbert, Jack A. ;
Smith, Geoff ;
Knight, Rob .
ISME JOURNAL, 2012, 6 (08) :1621-1624
[5]   Gene capture and random amplification for quantitative recovery of homologous genes [J].
Crosby, Laurel D. ;
Criddle, Craig S. .
MOLECULAR AND CELLULAR PROBES, 2007, 21 (02) :140-147
[6]   Search and clustering orders of magnitude faster than BLAST [J].
Edgar, Robert C. .
BIOINFORMATICS, 2010, 26 (19) :2460-2461
[7]   The Effect of Primer Choice and Short Read Sequences on the Outcome of 16S rRNA Gene Based Diversity Studies [J].
Ghyselinck, Jonas ;
Pfeiffer, Stefan ;
Heylen, Kim ;
Sessitsch, Angela ;
De Vos, Paul .
PLOS ONE, 2013, 8 (08)
[8]   Synthetic microbial communities [J].
Grosskopf, Tobias ;
Soyer, Orkun S. .
CURRENT OPINION IN MICROBIOLOGY, 2014, 18 :72-77
[9]   Microbiome science needs a healthy dose of scepticism [J].
Hanage, William P. .
NATURE, 2014, 512 (7514) :247-248
[10]   Considerations for the development and application of control materials to improve metagenomic microbial community profiling [J].
Huggett, Jim F. ;
Laver, Thomas ;
Tamisak, Sasithon ;
Nixon, Gavin ;
O'Sullivan, Denise M. ;
Elaswarapu, Ramnath ;
Studholme, David J. ;
Foy, Carole A. .
ACCREDITATION AND QUALITY ASSURANCE, 2013, 18 (02) :77-83