AUGUSTUS: a web server for gene finding in eukaryotes

被引:915
作者
Stanke, M
Steinkamp, R
Waack, S
Morgenstern, B
机构
[1] Univ Gottingen, Inst Mikrobiol & Genet, D-37077 Gottingen, Germany
[2] Univ Gottingen, Inst Numer & Angew Math, D-37083 Gottingen, Germany
关键词
D O I
10.1093/nar/gkh379
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a www server for AUGUSTUS, a novel software program for ab initio gene prediction in eukaryotic genomic sequences. Our method is based on a generalized Hidden Markov Model with a new method for modeling the intron length distribution. This method allows approximation of the true intron length distribution more accurately than do existing programs. For genomic sequence data from human and Drosophila melanogaster, the accuracy of AUGUSTUS is superior to existing gene-finding approaches. The advantage of our program becomes apparent especially for larger input sequences containing more than one gene. The server is available at http://augustus.gobics.de.
引用
收藏
页码:W309 / W312
页数:4
相关论文
共 24 条
  • [1] gff2ps:: visualizing genomic annotations
    Abril, JF
    Guigó, R
    [J]. BIOINFORMATICS, 2000, 16 (08) : 743 - 744
  • [2] [Anonymous], 1997, THESIS STANFORD U ST
  • [3] Bafna V., 2000, BIOINFORMATICS, V16, P190
  • [4] Human and mouse gene structure: Comparative analysis and application to exon prediction
    Batzoglou, S
    Pachter, L
    Mesirov, JP
    Berger, B
    Lander, ES
    [J]. GENOME RESEARCH, 2000, 10 (07) : 950 - 958
  • [5] Orphan gene finding -: an exon assembly approach
    Blayo, P
    Rouzé, P
    Sagot, MF
    [J]. THEORETICAL COMPUTER SCIENCE, 2003, 290 (03) : 1407 - 1431
  • [6] Fast and sensitive multiple alignment of large genomic sequences -: art. no. 66
    Brudno, M
    Chapman, M
    Göttgens, B
    Batzoglou, S
    Morgenstern, B
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [7] SLAM web server for comparative gene finding and alignment
    Cawley, S
    Pachter, L
    Alexandersson, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3507 - 3509
  • [8] Computational methods for the identification of genes in vertebrate genomic sequences
    Claverie, JM
    [J]. HUMAN MOLECULAR GENETICS, 1997, 6 (10) : 1735 - 1744
  • [9] The FlyBase database of the Drosophila genome projects and community literature
    Gelbart, W
    Bayraktaroglu, L
    Bettencourt, B
    Campbell, K
    Crosby, M
    Emmert, D
    Hradecky, P
    Huang, Y
    Letovsky, S
    Matthews, B
    Russo, S
    Schroeder, A
    Smutniak, F
    Zhou, P
    Zytkovicz, M
    Ashburner, M
    Drysdale, R
    de Grey, A
    Foulger, R
    Millburn, G
    Yamada, C
    Kaufman, T
    Matthews, K
    Gilbert, D
    Grumbling, G
    Strelets, V
    Shemen, C
    Rubin, G
    Berman, B
    Frise, E
    Gibson, M
    Harris, N
    Kaminker, J
    Lewis, S
    Marshall, B
    Misra, S
    Mungall, C
    Prochnik, S
    Richter, J
    Smith, C
    Shu, S
    Tupy, J
    Wiel, C
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 172 - 175
  • [10] An assessment of gene prediction accuracy in large DNA sequences
    Guigó, R
    Agarwal, P
    Abril, JF
    Burset, M
    Fickett, JW
    [J]. GENOME RESEARCH, 2000, 10 (10) : 1631 - 1642