GenDB -: an open source genome annotation system for prokaryote genomes

被引:570
作者
Meyer, F [1 ]
Goesmann, A
McHardy, AC
Bartels, D
Bekel, T
Clausen, J
Kalinowski, J
Linke, B
Rupp, O
Giegerich, R
Pühler, A
机构
[1] Univ Bielefeld, Ctr Gene Res, Tech Fak, D-4800 Bielefeld, Germany
[2] Univ Bielefeld, Dept Biol, D-4800 Bielefeld, Germany
关键词
D O I
10.1093/nar/gkg312
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The flood of sequence data resulting from the large number of current genome projects has increased the need for a flexible, open source genome annotation system, which so far has not existed. To account for the individual needs of different projects, such a system should be modular and easily extensible. We present a genome annotation system for prokaryote genomes, which is well tested and readily adaptable to different tasks. The modular system was developed using an object-oriented approach, and it relies on a relational database backend. Using a well defined application programmers interface (API), the system can be linked easily to other systems. GenDB supports manual as well as automatic annotation strategies. The software currently is in use in more than a dozen microbial genome annotation projects. In addition to its use as a production genome annotation system, it can be employed as a flexible framework for the large-scale evaluation of different annotation strategies. The system is open source.
引用
收藏
页码:2187 / 2195
页数:9
相关论文
共 20 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Automated genome sequence analysis and annotation
    Andrade, MA
    Brown, NP
    Leroy, C
    Hoersch, S
    de Daruvar, A
    Reich, C
    Franchini, A
    Tamames, J
    Valencia, A
    Ouzounis, C
    Sander, C
    [J]. BIOINFORMATICS, 1999, 15 (05) : 391 - 412
  • [3] The InterPro database, an integrated documentation resource for protein families, domains and functional sites
    Apweiler, R
    Attwood, TK
    Bairoch, A
    Bateman, A
    Birney, E
    Biswas, M
    Bucher, P
    Cerutti, T
    Corpet, F
    Croning, MDR
    Durbin, R
    Falquet, L
    Fleischmann, W
    Gouzy, J
    Hermjakob, H
    Hulo, N
    Jonassen, I
    Kahn, D
    Kanapin, A
    Karavidopoulou, Y
    Lopez, R
    Marx, B
    Mulder, NJ
    Oinn, TM
    Pagni, M
    Servant, F
    Sigrist, CJA
    Zdobnov, EM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 37 - 40
  • [4] CRITICA: Coding region identification tool invoking comparative analysis
    Badger, JH
    Olsen, GJ
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (04) : 512 - 524
  • [5] Improved microbial gene identification with GLIMMER
    Delcher, AL
    Harmon, D
    Kasif, S
    White, O
    Salzberg, SL
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (23) : 4636 - 4641
  • [6] Profile hidden Markov models
    Eddy, SR
    [J]. BIOINFORMATICS, 1998, 14 (09) : 755 - 763
  • [7] ETZOLD T, 1993, COMPUT APPL BIOSCI, V9, P49
  • [8] Functional and structural genomics using PEDANT
    Frishman, D
    Albermann, K
    Hani, J
    Heumann, K
    Metanomski, A
    Zollner, A
    Mewes, HW
    [J]. BIOINFORMATICS, 2001, 17 (01) : 44 - 57
  • [9] MAGPIE: Automated genome interpretation
    Gaasterland, T
    Sensen, CW
    [J]. TRENDS IN GENETICS, 1996, 12 (02) : 76 - 78
  • [10] The composite genome of the legume symbiont Sinorhizobium meliloti
    Galibert, F
    Finan, TM
    Long, SR
    Pühler, A
    Abola, P
    Ampe, F
    Barloy-Hubler, F
    Barnett, MJ
    Becker, A
    Boistard, P
    Bothe, G
    Boutry, M
    Bowser, L
    Buhrmester, J
    Cadieu, E
    Capela, D
    Chain, P
    Cowie, A
    Davis, RW
    Dréano, S
    Federspiel, NA
    Fisher, RF
    Gloux, S
    Godrie, T
    Goffeau, A
    Golding, B
    Gouzy, J
    Gurjal, M
    Hernandez-Lucas, I
    Hong, A
    Huizar, L
    Hyman, RW
    Jones, T
    Kahn, D
    Kahn, ML
    Kalman, S
    Keating, DH
    Kiss, E
    Komp, C
    Lalaure, V
    Masuy, D
    Palm, C
    Peck, MC
    Pohl, TM
    Portetelle, D
    Purnelle, B
    Ramsperger, U
    Surzycki, R
    Thébault, P
    Vandenbol, M
    [J]. SCIENCE, 2001, 293 (5530) : 668 - 672