RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank

被引:92
作者
Lundin, Daniel [1 ]
Torrents, Eduard [1 ,2 ]
Poole, Anthony M. [1 ,3 ]
Sjoberg, Britt-Marie [1 ]
机构
[1] Stockholm Univ, Dept Mol Biol & Funct Genom, S-10691 Stockholm, Sweden
[2] IBEC, Barcelona, Spain
[3] Univ Canterbury, Sch Biol Sci, Christchurch 1, New Zealand
基金
瑞典研究理事会;
关键词
ESCHERICHIA-COLI; UREAPLASMA-UREALYTICUM; ENTAMOEBA-HISTOLYTICA; BORRELIA-BURGDORFERI; PROTEIN SEQUENCES; ACTIVITY PROFILE; GENOME; ANNOTATION; EVOLUTION; COFACTOR;
D O I
10.1186/1471-2164-10-589
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Ribonucleotide reductases (RNRs) catalyse the only known de novo pathway for deoxyribonucleotide synthesis, and are therefore essential to DNA-based life. While ribonucleotide reduction has a single evolutionary origin, significant differences between RNRs nevertheless exist, notably in cofactor requirements, subunit composition and allosteric regulation. These differences result in distinct operational constraints (anaerobicity, iron/oxygen dependence and cobalamin dependence), and form the basis for the classification of RNRs into three classes. Description: In RNRdb (Ribonucleotide Reductase database), we have collated and curated all known RNR protein sequences with the aim of providing a resource for exploration of RNR diversity and distribution. By comparing expert manual annotations with annotations stored in Genbank, we find that significant inaccuracies exist in larger databases. To our surprise, only 23% of protein sequences included in RNRdb are correctly annotated across the key attributes of class, role and function, with 17% being incorrectly annotated across all three categories. This illustrates the utility of specialist databases for applications where a high degree of annotation accuracy may be important. The database houses information on annotation, distribution and diversity of RNRs, and links to solved RNR structures, and can be searched through a BLAST interface. RNRdb is accessible through a public web interface at http://rnrdb.molbio.su.se. Conclusion: RNRdb is a specialist database that provides a reliable annotation and classification resource for RNR proteins, as well as a tool to explore distribution patterns of RNR classes. The recent expansion in available genome sequence data have provided us with a picture of RNR distribution that is more complex than believed only a few years ago; our database indicates that RNRs of all three classes are found across all three cellular domains. Moreover, we find a number of organisms that encode all three classes.
引用
收藏
页数:8
相关论文
共 38 条
[1]   THE BIOLOGY OF GIARDIA SPP [J].
ADAM, RD .
MICROBIOLOGICAL REVIEWS, 1991, 55 (04) :706-732
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
Aravind L, 2000, J MOL MICROB BIOTECH, V2, P191
[4]  
BAUM KF, 1989, J BIOL CHEM, V264, P21087
[5]   Predicting functions from protein sequences - where are the bottlenecks? [J].
Bork, P ;
Koonin, EV .
NATURE GENETICS, 1998, 18 (04) :313-318
[6]  
BoursauxEude C, 1997, FEMS MICROBIOL LETT, V151, P257, DOI 10.1111/j.1574-6968.1997.tb12579.x
[7]   REPRESENTATION OF THE SECONDARY AND TERTIARY STRUCTURE OF GROUP-I INTRONS [J].
CECH, TR ;
DAMBERGER, SH ;
GUTELL, RR .
NATURE STRUCTURAL BIOLOGY, 1994, 1 (05) :273-280
[8]   NrdI, a flavodoxin involved in maintenance of the diferric-tyrosyl radical cofactor in Escherichia coli class Ib ribonucleotide reductase [J].
Cotruvo, Joseph A., Jr. ;
Stubbe, JoAnne .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (38) :14383-14388
[9]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[10]   The Pfam protein families database [J].
Finn, Robert D. ;
Tate, John ;
Mistry, Jaina ;
Coggill, Penny C. ;
Sammut, Stephen John ;
Hotz, Hans-Rudolf ;
Ceric, Goran ;
Forslund, Kristoffer ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D281-D288