Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective

被引:495
作者
Nilsson, R. Henrik [1 ]
Ryberg, Martin [1 ]
Kristiansson, Erik [2 ]
Abarenkov, Kessy [3 ]
Larsson, Karl-Henrik [1 ]
Koljalg, Urmas [3 ]
机构
[1] Univ Gothenburg, Dept Plant & Environm Sci, Gothenburg, Sweden
[2] Chalmers Univ Technol, Dept Math Stat, S-41296 Gothenburg, Sweden
[3] Univ Tartu, Inst Bot & Ecol, EE-50090 Tartu, Estonia
关键词
DIVERSITY; IDENTIFICATION; EXAMPLE;
D O I
10.1371/journal.pone.0000059
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background. DNA sequences are increasingly seen as one of the primary information sources for species identification in many organism groups. Such approaches, popularly known as barcoding, are underpinned by the assumption that the reference databases used for comparison are sufficiently complete and feature correctly and informatively annotated entries. Methodology/Principal Findings. The present study uses a large set of fungal DNA sequences from the inclusive International Nucleotide Sequence Database to show that the taxon sampling of fungi is far from complete, that about 20% of the entries may be incorrectly identified to species level, and that the majority of entries lack descriptive and up-to-date annotations. Conclusions. The problems with taxonomic reliability and insufficient annotations in public DNA repositories form a tangible obstacle to sequence-based species identification, and it is manifest that the greatest challenges to biological barcoding will be of taxonomical, rather than technical, nature.
引用
收藏
页数:4
相关论文
共 25 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Ribosomal ITS sequences and plant phylogenetic inference [J].
Alvarez, I ;
Wendel, JF .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2003, 29 (03) :417-434
[3]  
Benson Dennis A, 2005, Nucleic Acids Res, V33, pD34
[4]   Defining operational taxonomic units using DNA barcode data [J].
Blaxter, M ;
Mann, J ;
Chapman, T ;
Thomas, F ;
Whitton, C ;
Floyd, R ;
Abebe, E .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) :1935-1943
[5]   On the unreliability of published DNA sequences [J].
Bridge, PD ;
Roberts, PJ ;
Spooner, BM ;
Panchal, G .
NEW PHYTOLOGIST, 2003, 160 (01) :43-48
[6]  
Bruns TD, 2004, CAN J BOT, V82, P1122, DOI [10.1139/b04-021, 10.1139/B04-021]
[7]   Glomales rRNA gene diversity - all that glisten's is not necessarily glomalean? [J].
Clapp, JP ;
Rodriguez, A ;
Dodd, JC .
MYCORRHIZA, 2002, 12 (05) :269-270
[8]   What are bacterial species? [J].
Cohan, FM .
ANNUAL REVIEW OF MICROBIOLOGY, 2002, 56 :457-487
[9]   DNA barcoding is no substitute for taxonomy [J].
Ebach, MC ;
Holdrege, C .
NATURE, 2005, 434 (7034) :697-697
[10]   Critical factors for assembling a high volume of DNA barcodes [J].
Hajibabaei, M ;
DeWaard, JR ;
Ivanova, NV ;
Ratnasingham, S ;
Dooh, RT ;
Kirk, SL ;
Mackie, PM ;
Hebert, PDN .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) :1959-1967