The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data

被引:94
作者
Sze, Marc A. [1 ]
Schloss, Patrick D. [1 ]
机构
[1] Univ Michigan, Dept Microbiol & Immunol, Ann Arbor, MI 48109 USA
关键词
16S rRNA gene; PCR; bias; environmental microbiology; microbial ecology; microbiome; polymerase; sequence analysis; STORAGE-CONDITIONS; BIAS; DIVERSITY; LIBRARIES;
D O I
10.1128/mSphere.00163-19
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
PCR amplification of 16S rRNA genes is a critical yet underappreciated step in the generation of sequence data to describe the taxonomic composition of microbial communities. Numerous factors in the design of PCR can impact the sequencing error rate, the abundance of chimeric sequences, and the degree to which the fragments in the product represent their abundance in the original sample (i.e., bias). We compared the performance of high fidelity polymerases and various numbers of rounds of amplification when amplifying a mock community and human stool samples. Although it was impossible to derive specific recommendations, we did observe general trends. Namely, using a polymerase with the highest possible fidelity and minimizing the number of rounds of PCR reduced the sequencing error rate, fraction of chimeric sequences, and bias. Evidence of bias at the sequence level was subtle and could not be ascribed to the fragments' fraction of bases that were guanines or cytosines. When analyzing mock community data, the amount that the community deviated from the expected composition increased with the number of rounds of PCR. This bias was inconsistent for human stool samples. Overall, the results underscore the difficulty of comparing sequence data that are generated by different PCR protocols. However, the results indicate that the variation in human stool samples is generally larger than that introduced by the choice of polymerase or number of rounds of PCR. IMPORTANCE A steep decline in sequencing costs drove an explosion in studies characterizing microbial communities from diverse environments. Although a significant amount of effort has gone into understanding the error profiles of DNA sequencers, little has been done to understand the downstream effects of the PCR amplification protocol. We quantified the effects of the choice of polymerase and number of PCR cycles on the quality of downstream data. We found that these choices can have a profound impact on the way that a microbial community is represented in the sequence data. The effects are relatively small compared to the variation in human stool samples; however, care should be taken to use polymerases with the highest possible fidelity and to minimize the number of rounds of PCR. These results also underscore that it is not possible to directly compare sequence data generated under different PCR conditions.
引用
收藏
页数:13
相关论文
共 53 条
[1]   PCR-induced sequence artifacts and bias: Insights from comparison of two 16S rRNA clone libraries constructed from the same sample [J].
Acinas, SG ;
Sarma-Rupavtarm, R ;
Klepac-Ceraj, V ;
Polz, MF .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2005, 71 (12) :8966-8969
[2]   Correcting for Microbial Blooms in Fecal Samples during Room-Temperature Shipping [J].
Amir, Amnon ;
McDonald, Daniel ;
Navas-Molina, Jose A. ;
Debelius, Justine ;
Morton, James T. ;
Hyde, Embriette ;
Robbins-Pianka, Adam ;
Knight, Rob .
MSYSTEMS, 2017, 2 (02)
[3]   Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns [J].
Amir, Amnon ;
McDonald, Daniel ;
Navas-Molina, Jose A. ;
Kopylova, Evguenia ;
Morton, James T. ;
Xu, Zhenjiang Zech ;
Kightley, Eric P. ;
Thompson, Luke R. ;
Hyde, Embriette R. ;
Gonzalez, Antonio ;
Knight, Rob .
MSYSTEMS, 2017, 2 (02)
[4]   Comparison of stool versus rectal swab samples and storage conditions on bacterial community profiles [J].
Bassis, Christine M. ;
Moore, Nicholas M. ;
Lolans, Karen ;
Seekatz, Anna M. ;
Weinstein, Robert A. ;
Young, Vincent B. ;
Hayden, Mary K. .
BMC MICROBIOLOGY, 2017, 17
[5]   The impact of sampling, PCR, and sequencing replication on discerning changes in drinking water bacterial community over diurnal time-scales [J].
Bautista-de los Santos, Quyen Melina ;
Schroeder, Joanna L. ;
Blakemore, Oliver ;
Moses, Jonathan ;
Haffey, Mark ;
Sloan, William ;
Pinto, Ameet J. .
WATER RESEARCH, 2016, 90 :216-224
[6]  
Bokulich NA, 2013, NAT METHODS, V10, P57, DOI [10.1038/NMETH.2276, 10.1038/nmeth.2276]
[7]   The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies [J].
Brooks, J. Paul ;
Edwards, David J. ;
Harwich, Michael D., Jr. ;
Rivera, Maria C. ;
Fettweis, Jennifer M. ;
Serrano, Myrna G. ;
Reris, Robert A. ;
Sheth, Nihar U. ;
Huang, Bernice ;
Girerd, Philippe ;
Strauss, Jerome F., III ;
Jefferson, Kimberly K. ;
Buck, Gregory A. .
BMC MICROBIOLOGY, 2015, 15
[8]  
Callahan BJ, 2016, NAT METHODS, V13, P581, DOI [10.1038/NMETH.3869, 10.1038/nmeth.3869]
[9]   Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Lozupone, Catherine A. ;
Turnbaugh, Peter J. ;
Fierer, Noah ;
Knight, Rob .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 :4516-4522
[10]   Effect of PCR template concentration on the composition and distribution of total community 16S rDNA clone libraries [J].
Chandler, DP ;
Fredrickson, JK ;
Brockman, FJ .
MOLECULAR ECOLOGY, 1997, 6 (05) :475-482