ITS all right mama: investigating the formation of chimeric sequences in the ITS2 region by DNA metabarcoding analyses of fungal mock communities of different complexities

被引:43
作者
Aas, Anders Bjornsgaard [1 ]
Davey, Marie Louise [1 ]
Kauserud, Havard [1 ]
机构
[1] Univ Oslo, Dept Biosci, Sect Genet & Evolutionary Biol Evogene, POB 1066 Blindern, NO-0316 Oslo, Norway
关键词
chimeric sequences; DNA metabarcoding; microbial ecology; PCR bias; UCHIME; 16S RIBOSOMAL-RNA; PYROSEQUENCED AMPLICONS; BACTERIAL COMMUNITIES; DIVERSITY; ECOLOGY; GENES; IDENTIFICATION; AMPLIFICATION; ARTIFACTS; RICHNESS;
D O I
10.1111/1755-0998.12622
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The formation of chimeric sequences can create significant methodological bias in PCR-based DNA metabarcoding analyses. During mixed-template amplification of barcoding regions, chimera formation is frequent and well documented. However, profiling of fungal communities typically uses the more variable rDNA region ITS. Due to a larger research community, tools for chimera detection have been developed mainly for the 16S/18S markers. However, these tools are widely applied to the ITS region without verification of their performance. We examined the rate of chimera formation during amplification and 454 sequencing of the ITS2 region from fungal mock communities of different complexities. We evaluated the chimera detecting ability of two common chimera-checking algorithms: PERSEUS and UCHIME. Large proportions of the chimeras reported were false positives. No false negatives were found in the data set. Verified chimeras accounted for only 0.2% of the total ITS2 reads, which is considerably less than what is typically reported in 16S and 18S metabarcoding analyses. Verified chimeric 'parent sequences' had significantly higher per cent identity to one another than to random members of the mock communities. Community complexity increased the rate of chimera formation. GC content was higher around the verified chimeric break points, potentially facilitating chimera formation through base pair mismatching in the neighbouring regions of high similarity in the chimeric region. We conclude that the hypervariable nature of the ITS region seems to buffer the rate of chimera formation in comparison with other, less variable barcoding regions, due to shorter regions of high sequence similarity.
引用
收藏
页码:730 / 741
页数:12
相关论文
共 56 条
[1]   The UNITE database for molecular identification of fungi - recent updates and future perspectives [J].
Abarenkov, Kessy ;
Nilsson, R. Henrik ;
Larsson, Karl-Henrik ;
Alexander, Ian J. ;
Eberhardt, Ursula ;
Erland, Susanne ;
Hoiland, Klaus ;
Kjoller, Rasmus ;
Larsson, Ellen ;
Pennanen, Taina ;
Sen, Robin ;
Taylor, Andy F. S. ;
Tedersoo, Leho ;
Ursing, Bjorn M. ;
Vralstad, Trude ;
Liimatainen, Kare ;
Peintner, Ursula ;
Koljalg, Urmas .
NEW PHYTOLOGIST, 2010, 186 (02) :281-285
[2]   Effects of PCR cycle number and DNA polymerase type on the 16S rRNA gene pyrosequencing analysis of bacterial communities [J].
Ahn, Jae-Hyung ;
Kim, Byung-Yong ;
Song, Jaekyeong ;
Weon, Hang-Yeon .
JOURNAL OF MICROBIOLOGY, 2012, 50 (06) :1071-1074
[3]  
[Anonymous], 2008, R LANG ENV STAT COMP
[4]   At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies [J].
Ashelford, KE ;
Chuzhanova, NA ;
Fry, JC ;
Jones, AJ ;
Weightman, AJ .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2005, 71 (12) :7724-7736
[5]   ITS as an environmental DNA barcode for fungi: an in silico approach reveals potential PCR biases [J].
Bellemain, Eva ;
Carlsen, Tor ;
Brochmann, Christian ;
Coissac, Eric ;
Taberlet, Pierre ;
Kauserud, Havard .
BMC MICROBIOLOGY, 2010, 10
[6]   Improved software detection and extraction of ITS1 and ITS2 from ribosomal ITS sequences of fungi and other eukaryotes for analysis of environmental sequencing data [J].
Bengtsson-Palme, Johan ;
Ryberg, Martin ;
Hartmann, Martin ;
Branco, Sara ;
Wang, Zheng ;
Godhe, Anna ;
De Wit, Pierre ;
Sanchez-Garcia, Marisol ;
Ebersberger, Ingo ;
de Sousa, Filipe ;
Amend, Anthony S. ;
Jumpponen, Ari ;
Unterseher, Martin ;
Kristiansson, Erik ;
Abarenkov, Kessy ;
Bertrand, Yann J. K. ;
Sanli, Kemal ;
Eriksson, K. Martin ;
Vik, Unni ;
Veldre, Vilmar ;
Nilsson, R. Henrik .
METHODS IN ECOLOGY AND EVOLUTION, 2013, 4 (10) :914-919
[7]   ITS1 versus ITS2 as DNA metabarcodes for fungi [J].
Blaalid, R. ;
Kumar, S. ;
Nilsson, R. H. ;
Abarenkov, K. ;
Kirk, P. M. ;
Kauserud, H. .
MOLECULAR ECOLOGY RESOURCES, 2013, 13 (02) :218-224
[8]   Scraping the bottom of the barrel: are rare high throughput sequences artifacts? [J].
Brown, Shawn P. ;
Veach, Allison M. ;
Rigdon-Huss, Anne R. ;
Grond, Kirsten ;
Lickteig, Spencer K. ;
Lothamer, Kale ;
Oliver, Alena K. ;
Jumpponen, Ari .
FUNGAL ECOLOGY, 2015, 13 :221-225
[9]   MatGAT: An application that generates similarity/identity matrices using protein or DNA sequences [J].
Campanella, JJ ;
Bitincka, L ;
Smalley, J .
BMC BIOINFORMATICS, 2003, 4 (1)
[10]   QIIME allows analysis of high-throughput community sequencing data [J].
Caporaso, J. Gregory ;
Kuczynski, Justin ;
Stombaugh, Jesse ;
Bittinger, Kyle ;
Bushman, Frederic D. ;
Costello, Elizabeth K. ;
Fierer, Noah ;
Pena, Antonio Gonzalez ;
Goodrich, Julia K. ;
Gordon, Jeffrey I. ;
Huttley, Gavin A. ;
Kelley, Scott T. ;
Knights, Dan ;
Koenig, Jeremy E. ;
Ley, Ruth E. ;
Lozupone, Catherine A. ;
McDonald, Daniel ;
Muegge, Brian D. ;
Pirrung, Meg ;
Reeder, Jens ;
Sevinsky, Joel R. ;
Tumbaugh, Peter J. ;
Walters, William A. ;
Widmann, Jeremy ;
Yatsunenko, Tanya ;
Zaneveld, Jesse ;
Knight, Rob .
NATURE METHODS, 2010, 7 (05) :335-336