Evaluating the annotation of protein-coding genes in bacterial genomes: Chloroflexus aurantiacus strain J-10-fl and Natrinema sp J7-2 as case studies

被引:2
作者
Zhang, H. X. [1 ]
Li, S. J. [1 ]
Zhou, H. Q. [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Key Lab Neuroinformat, Minist Educ, Chengdu 610054, Peoples R China
关键词
Gene density; Genome re-annotation; Missed genes; Over-annotated genes; AEROPYRUM-PERNIX K1; RE-ANNOTATION; GRAPHICAL REPRESENTATION; CODON USAGE; SEQUENCE; RECOGNITION; PREDICTION; PATTERNS; SYSTEM; ZCURVE;
D O I
10.4238/2014.December.19.10
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene annotation plays a key role in subsequent biochemical and molecular biological studies of various organisms. There are some errors in the original annotation of sequenced genomes because of the lack of sufficient data, and these errors may propagate into other genomes. Therefore, genome annotation must be checked from time to time to evaluate newly accumulated data. In this study, we evaluated the gene density of 2606 bacteria or archaea, and identified 2 with extreme values, the minimum value (Chloroflexus aurantiacus strain J-10-fl) and maximum value (Natrinema sp J7-2), to conduct genome re-annotation. In the genome of C. aurantiacus strain J-10-fl, we identified 17 new genes with definite functions and eliminated 34 non-coding open-reading frames; in the genome of Natrinema sp J7-2, we eliminated 118 non-coding open reading frames. Our re-annotation procedure may provide a reference for improving the annotation of other bacterial genomes.
引用
收藏
页码:10891 / 10897
页数:7
相关论文
共 20 条
[1]   Re-annotation of genome microbial CoDing-Sequences:: finding new genes and inaccurately annotated genes -: art. no. 5 [J].
Bocs, S ;
Danchin, A ;
Médigue, C .
BMC BIOINFORMATICS, 2002, 3 (1)
[2]   Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv [J].
Camus, JC ;
Pryor, MJ ;
Médigue, C ;
Cole, ST .
MICROBIOLOGY-SGM, 2002, 148 :2967-2973
[3]   Seven GC-rich microbial genomes adopt similar codon usage patterns regardless of their phylogenetic lineages [J].
Chen, LL ;
Zhang, CT .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 306 (01) :310-317
[4]   Gene Re-annotation in Genome of the Extremophile Pyrobaculum Aerophilum by Using Bioinformatics Methods [J].
Du, Meng-Ze ;
Guo, Feng-Biao ;
Chen, Yue-Yun .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2011, 29 (02) :391-401
[5]   The Complete Genome Sequence of Natrinema sp J7-2, a Haloarchaeon Capable of Growth on Synthetic Media without Amino Acid Supplements [J].
Feng, Jie ;
Liu, Bin ;
Zhang, Ziqian ;
Ren, Yan ;
Li, Yang ;
Gan, Fei ;
Huang, Yuping ;
Chen, Xiangdong ;
Shen, Ping ;
Wang, Lei ;
Tang, Bing ;
Tang, Xiao-Feng .
PLOS ONE, 2012, 7 (07)
[6]   Re-annotation and re-analysis of the Campylobacter jejuni NCTC11168 genome sequence [J].
Gundogdu, Ozan ;
Bentley, Stephen D. ;
Holden, Matt T. ;
Parkhill, Julian ;
Dorrell, Nick ;
Wren, Brendan W. .
BMC GENOMICS, 2007, 8
[7]  
Guo FB, 2006, BMC BIOINFORMATICS, V7, DOI 10.1186/1471-2105-7-9
[8]   Gene recognition based on nucleotide distribution of ORFs in a hyper-thermophilic crenarchaeon, Aeropyrum pernix K1 [J].
Guo, FB ;
Wang, J ;
Zhang, CT .
DNA RESEARCH, 2004, 11 (06) :361-370
[9]   ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes [J].
Guo, FB ;
Ou, HY ;
Zhang, CT .
NUCLEIC ACIDS RESEARCH, 2003, 31 (06) :1780-1789
[10]   Re-prediction of protein-coding genes in the genome of Amsacta moorei entomopoxvirus [J].
Guo, Feng-Biao ;
Yub, Xiu-Juan .
JOURNAL OF VIROLOGICAL METHODS, 2007, 146 (1-2) :389-392