Database size positively correlates with the loss of species-level taxonomic resolution for the 16S rRNA and other prokaryotic marker genes

被引:0
|
作者
Commichaux, Seth [1 ]
Luan, Tu [2 ,3 ]
Muralidharan, Harihara Subrahmaniam [2 ,3 ]
Pop, Mihai [2 ,3 ]
机构
[1] Food & Drug Adm, Ctr Food Safety & Nutr, Laurel, MD 20708 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD USA
[3] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD USA
基金
美国国家卫生研究院;
关键词
CATALOG;
D O I
10.1371/journal.pcbi.1012343
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
For decades, the 16S rRNA gene has been used to taxonomically classify prokaryotic species and to taxonomically profile microbial communities. However, the 16S rRNA gene has been criticized for being too conserved to differentiate between distinct species. We argue that the inability to differentiate between species is not a unique feature of the 16S rRNA gene. Rather, we observe the gradual loss of species-level resolution for other nearly-universal prokaryotic marker genes as the number of gene sequences increases in reference databases. This trend was strongly correlated with how represented a taxonomic group was in the database and indicates that, at the gene-level, the boundaries between many species might be fuzzy. Through our study, we argue that any approach that relies on a single marker to distinguish bacterial taxa is fraught even if some markers appear to be discriminative in current databases.
引用
收藏
页数:12
相关论文
共 44 条