Protax-fungi: a web-based tool for probabilistic taxonomic placement of fungal internal transcribed spacer sequences

被引:59
作者
Abarenkov, Kessy [1 ]
Somervuo, Panu [2 ]
Nilsson, R. Henrik [3 ,4 ]
Kirk, Paul M. [5 ]
Huotari, Tea [6 ]
Abrego, Nerea [6 ]
Ovaskainen, Otso [2 ,7 ]
机构
[1] Univ Tartu, Nat Hist Museum, Vanemuise 46, EE-51014 Tartu, Estonia
[2] Univ Helsinki, Organismal & Evolutionary Biol Res Programme, POB 65, FI-00014 Helsinki, Finland
[3] Univ Gothenburg, Dept Biol & Environm Sci, Box 461, S-40530 Gothenburg, Sweden
[4] Gothenburg Global Biodivers Ctr, Box 461, SE-40530 Gothenburg, Sweden
[5] Royal Bot Gardens, Richmond TW9 3DS, Surrey, England
[6] Univ Helsinki, Dept Agr Sci, POB 27, FI-00014 Helsinki, Finland
[7] Norwegian Univ Sci & Technol, Dept Biol, Ctr Biodivers Dynam, N-7491 Trondheim, Norway
基金
芬兰科学院;
关键词
annotation; data quality; environmental sequencing; fungi; identification tool; internal transcribed spacer (ITS); molecular species identification; probabilistic taxonomic assignment; IDENTIFICATION; CLASSIFICATION; DATABASE; REVEALS; TRAITS;
D O I
10.1111/nph.15301
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Incompleteness of reference sequence databases and unresolved taxonomic relationships complicates taxonomic placement of fungal sequences. We developed Protax-fungi, a general tool for taxonomic placement of fungal internal transcribed spacer (ITS) sequences, and implemented it into the PlutoF platform of the UNITE database for molecular identification of fungi. With empirical data on root- and wood-associated fungi, Protax-fungi reliably identified (with at least 90% identification probability) the majority of sequences to the order level but only around one-fifth of them to the species level, reflecting the current limited coverage of the databases. Protax-fungi outperformed the Sintax and Rdb classifiers in terms of increased accuracy and decreased calibration error when applied to data on mock communities representing species groups with poor sequence database coverage. We applied Protax-fungi to examine the internal consistencies of the Index Fungorum and UNITE databases. This revealed inconsistencies in the taxonomy database as well as mislabelling and sequence quality problems in the reference database. The according improvements were implemented in both databases. Protax-fungi provides a robust tool for performing statistically reliable identifications of fungi in spite of the incompleteness of extant reference sequence databases and unresolved taxonomic relationships.
引用
收藏
页码:517 / 525
页数:9
相关论文
共 30 条
[1]   ITS all right mama: investigating the formation of chimeric sequences in the ITS2 region by DNA metabarcoding analyses of fungal mock communities of different complexities [J].
Aas, Anders Bjornsgaard ;
Davey, Marie Louise ;
Kauserud, Havard .
MOLECULAR ECOLOGY RESOURCES, 2017, 17 (04) :730-741
[2]   PlutoF-a Web Based Workbench for Ecological and Taxonomic Research, with an Online Implementation for Fungal ITS Sequences [J].
Abarenkov, Kessy ;
Tedersoo, Leho ;
Nilsson, R. Henrik ;
Vellak, Kai ;
Saar, Irja ;
Veldre, Vilmar ;
Parmasto, Erast ;
Prous, Marko ;
Aan, Anne ;
Ots, Margus ;
Kurina, Olavi ;
Ostonen, Ivika ;
Jogeva, Janno ;
Halapuu, Siim ;
Poldmaa, Kadri ;
Toots, Maert ;
Truu, Jaak ;
Larsson, Karl-Henrik ;
Koljalg, Urmas .
EVOLUTIONARY BIOINFORMATICS, 2010, 6 :189-196
[3]  
[Anonymous], 2016, BIORXIV, DOI [DOI 10.1101/074161V1, 10.1101/074161, DOI 10.1101/074161]
[4]   A fungal mock community control for amplicon sequencing experiments [J].
Bakker, Matthew G. .
MOLECULAR ECOLOGY RESOURCES, 2018, 18 (03) :541-556
[5]   Search and clustering orders of magnitude faster than BLAST [J].
Edgar, Robert C. .
BIOINFORMATICS, 2010, 26 (19) :2460-2461
[6]   Habitat conditions and phenological tree traits overrule the influence of tree genotype in the needle mycobiome-Picea glauca system at an arctic treeline ecotone [J].
Eusemann, Pascal ;
Schnittler, Martin ;
Nilsson, R. Henrik ;
Jumpponen, Ari ;
Dahl, Mathilde B. ;
Wuerth, David G. ;
Buras, Allan ;
Wilmking, Martin ;
Unterseher, Martin .
NEW PHYTOLOGIST, 2016, 211 (04) :1221-1231
[7]   Modeling the percolation of annotation errors in a database of protein sequences [J].
Gilks, WR ;
Audit, B ;
De Angelis, D ;
Tsoka, S ;
Ouzounis, CA .
BIOINFORMATICS, 2002, 18 (12) :1641-1649
[8]   Discovery of dark matter fungi in aquatic ecosystems demands a reappraisal of the phylogeny and ecology of zoosporic fungi [J].
Grossart, Hans-Peter ;
Wurzbacher, Christian ;
James, Timothy Y. ;
Kagami, Maiko .
FUNGAL ECOLOGY, 2016, 19 :28-38
[9]   Critical Issues in Mycobiota Analysis [J].
Halwachs, Bettina ;
Madhusudhan, Nandhitha ;
Krause, Robert ;
Nilsson, R. Henrik ;
Moissl-Eichinger, Christine ;
Hoegenauer, Christoph ;
Thallinger, Gerhard G. ;
Gorkiewicz, Gregor .
FRONTIERS IN MICROBIOLOGY, 2017, 8
[10]   Proposals to permit DNA sequence data to serve as types of names of fungi [J].
Hawksworth, David L. ;
Hibbett, David S. ;
Kirk, Paul M. ;
Luecking, Robert .
TAXON, 2016, 65 (04) :899-900