Great differences in performance and outcome of high-throughput sequencing data analysis platforms for fungal metabarcoding

被引:45
作者
Anslan, Sten [1 ]
Nilsson, R. Henrik [2 ]
Wurzbacher, Christian [3 ]
Baldrian, Petr [4 ]
Tedersoo, Leho [5 ]
Bahram, Mohammad [6 ,7 ,8 ]
机构
[1] Braunschweig Univ Technol, Zool Inst, Mendelssohnstr 4, D-38106 Braunschweig, Germany
[2] Univ Gothenburg, Dept Biol & Environm Sci, Gothenburg Global Biodivers Ctr, Box 461, S-40530 Gothenburg, Sweden
[3] Tech Univ Munich, Coulombwall 3, D-85748 Garching, Germany
[4] Czech Acad Sci, Inst Microbiol, Videnska 1083, Prague 14220 4, Czech Republic
[5] Tartu Univ, Nat Hist Museum, 14a Ravila, Tartu 50411, Estonia
[6] Tartu Univ, Inst Ecol & Earth Sci, 14a Ravila, EE-50411 Tartu, Estonia
[7] Uppsala Univ, Dept Organismal Biol, Evolutionary Biol Ctr, Norbyvagen 18D, Uppsala, Sweden
[8] Swedish Univ Agr Sci, Dept Ecol, Ulls Vag 16, S-75651 Uppsala, Sweden
关键词
Microbial communities; microbiome; mycobiome; fungal biodiversity; metagenomics; amplicon sequencing; INTERNAL TRANSCRIBED SPACER; BAYESIAN CLASSIFIER; IDENTIFICATION;
D O I
10.3897/mycokeys.39.28109
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Along with recent developments in high-throughput sequencing (HTS) technologies and thus fast accumulation of HTS data, there has been a growing need and interest for developing tools for HTS data processing and communication. In particular, a number of bioinformatics tools have been designed for analysing metabarcoding data, each with specific features, assumptions and outputs. To evaluate the potential effect of the application of different bioinformatics workflow on the results, we compared the performance of different analysis platforms on two contrasting high-throughput sequencing data sets. Our analysis revealed that the computation time, quality of error filtering and hence output of specific bioinformatics process largely depends on the platform used. Our results show that none of the bioinformatics workflows appears to perfectly filter out the accumulated errors and generate Operational Taxonomic Units, although PipeCraft, LotuS and PIPITS perform better than QIIME2 and Galaxy for the tested fungal amplicon dataset. We conclude that the output of each platform requires manual validation of the OTUs by examining the taxonomy assignment values.
引用
收藏
页码:29 / 40
页数:12
相关论文
共 35 条
[1]   PlutoF-a Web Based Workbench for Ecological and Taxonomic Research, with an Online Implementation for Fungal ITS Sequences [J].
Abarenkov, Kessy ;
Tedersoo, Leho ;
Nilsson, R. Henrik ;
Vellak, Kai ;
Saar, Irja ;
Veldre, Vilmar ;
Parmasto, Erast ;
Prous, Marko ;
Aan, Anne ;
Ots, Margus ;
Kurina, Olavi ;
Ostonen, Ivika ;
Jogeva, Janno ;
Halapuu, Siim ;
Poldmaa, Kadri ;
Toots, Maert ;
Truu, Jaak ;
Larsson, Karl-Henrik ;
Koljalg, Urmas .
EVOLUTIONARY BIOINFORMATICS, 2010, 6 :189-196
[2]   The UNITE database for molecular identification of fungi - recent updates and future perspectives [J].
Abarenkov, Kessy ;
Nilsson, R. Henrik ;
Larsson, Karl-Henrik ;
Alexander, Ian J. ;
Eberhardt, Ursula ;
Erland, Susanne ;
Hoiland, Klaus ;
Kjoller, Rasmus ;
Larsson, Ellen ;
Pennanen, Taina ;
Sen, Robin ;
Taylor, Andy F. S. ;
Tedersoo, Leho ;
Ursing, Bjorn M. ;
Vralstad, Trude ;
Liimatainen, Kare ;
Peintner, Ursula ;
Koljalg, Urmas .
NEW PHYTOLOGIST, 2010, 186 (02) :281-285
[3]   The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update [J].
Afgan, Enis ;
Baker, Dannon ;
van den Beek, Marius ;
Blankenberg, Daniel ;
Bouvier, Dave ;
Cech, Martin ;
Chilton, John ;
Clements, Dave ;
Coraor, Nate ;
Eberhard, Carl ;
Gruening, Bjoern ;
Guerler, Aysam ;
Hillman-Jackson, Jennifer ;
Von Kuster, Greg ;
Rasche, Eric ;
Soranzo, Nicola ;
Turaga, Nitesh ;
Taylor, James ;
Nekrutenko, Anton ;
Goecks, Jeremy .
NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) :W3-W10
[4]   PERMANOVA, ANOSIM, and the Mantel test in the face of heterogeneous dispersions: What null hypothesis are you testing? [J].
Anderson, Marti J. ;
Walsh, Daniel C. I. .
ECOLOGICAL MONOGRAPHS, 2013, 83 (04) :557-574
[5]   Seasonal and annual variation in fungal communities associated with epigeic springtails (Collembola spp.) in boreal forests [J].
Anslan, Sten ;
Bahram, Mohammad ;
Tedersoo, Leho .
SOIL BIOLOGY & BIOCHEMISTRY, 2018, 116 :245-252
[6]   PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data [J].
Anslan, Sten ;
Bahram, Mohammad ;
Hiiesalu, Indrek ;
Tedersoo, Leho .
MOLECULAR ECOLOGY RESOURCES, 2017, 17 (06) :e234-e240
[7]   Improved software detection and extraction of ITS1 and ITS2 from ribosomal ITS sequences of fungi and other eukaryotes for analysis of environmental sequencing data [J].
Bengtsson-Palme, Johan ;
Ryberg, Martin ;
Hartmann, Martin ;
Branco, Sara ;
Wang, Zheng ;
Godhe, Anna ;
De Wit, Pierre ;
Sanchez-Garcia, Marisol ;
Ebersberger, Ingo ;
de Sousa, Filipe ;
Amend, Anthony S. ;
Jumpponen, Ari ;
Unterseher, Martin ;
Kristiansson, Erik ;
Abarenkov, Kessy ;
Bertrand, Yann J. K. ;
Sanli, Kemal ;
Eriksson, K. Martin ;
Vik, Unni ;
Veldre, Vilmar ;
Nilsson, R. Henrik .
METHODS IN ECOLOGY AND EVOLUTION, 2013, 4 (10) :914-919
[8]   Manipulation of FASTQ data with Galaxy [J].
Blankenberg, Daniel ;
Gordon, Assaf ;
Von Kuster, Gregory ;
Coraor, Nathan ;
Taylor, James ;
Nekrutenko, Anton .
BIOINFORMATICS, 2010, 26 (14) :1783-1785
[9]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[10]   Scraping the bottom of the barrel: are rare high throughput sequences artifacts? [J].
Brown, Shawn P. ;
Veach, Allison M. ;
Rigdon-Huss, Anne R. ;
Grond, Kirsten ;
Lickteig, Spencer K. ;
Lothamer, Kale ;
Oliver, Alena K. ;
Jumpponen, Ari .
FUNGAL ECOLOGY, 2015, 13 :221-225