A HIDDEN MARKOV MODEL THAT FINDS GENES IN ESCHERICHIA-COLI DNA

被引:174
|
作者
KROGH, A
MIAN, IS
HAUSSLER, D
机构
[1] UNIV CALIF SANTA CRUZ, SANTA CRUZ, CA 95064 USA
[2] UNIV CALIF SANTA CRUZ, SINSHEIMER LABS, SANTA CRUZ, CA 95064 USA
[3] NORDITA, DK-2100 COPENHAGEN, DENMARK
关键词
D O I
10.1093/nar/22.22.4768
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A hidden Markov model (HMM) has been developed to find protein coding genes in E.coli DNA using E.coli genome DNA sequence from the EcoSeq6 database maintained by Kenn Rudd. This HMM includes states that model the codons and their frequencies in E.coli genes, as well as the patterns found in the intergenic region, including repetitive extragenic palindromic sequences and the Shine - Delgarno motif. To account for potential sequencing errors and or frameshifts in raw genomic DNA sequence, it allows for the (very unlikely) possiblity of insertions and deletions of individual nucleotides within a codon. The parameters of the HMM are estimated using approximately one million nucleotides of annotated DNA in EcoSeq6 and the model tested on a disjoint set of contigs containing about 325,000 nucleotides. The HMM finds the exact locations of about 80% of the known E.coli genes, and approximate locations for about 10%. It also finds several potentially new genes, and locates several places were insertion or deletion errors/and or frameshifts may be present in the contigs.
引用
收藏
页码:4768 / 4778
页数:11
相关论文
共 50 条
  • [1] Finding genes in DNA with a Hidden Markov Model
    Henderson, J
    Salzberg, S
    Fasman, KH
    JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (02) : 127 - 141
  • [2] CLONING OF STREPTOMYCES DNA INTO ESCHERICHIA-COLI - ABSENCE OF HETEROSPECIFIC GENE-EXPRESSION OF STREPTOMYCES GENES IN ESCHERICHIA-COLI
    HORINOUCHI, S
    UOZUMI, T
    BEPPU, T
    AGRICULTURAL AND BIOLOGICAL CHEMISTRY, 1980, 44 (02): : 367 - 381
  • [3] CLONING OF ESCHERICHIA-COLI DNA-REPAIR GENES ON PLASMIDS
    RUPP, WD
    SANCAR, A
    KENNEDY, WJ
    AYERS, J
    GRISWOLD, J
    JOURNAL OF SUPRAMOLECULAR STRUCTURE, 1978, : 51 - 51
  • [4] ISOLATION OF ESCHERICHIA-COLI GENES FOR THE REPAIR OF ALKYLATION DAMAGE IN DNA
    MARGISON, GP
    COOPER, DP
    BRENNAND, J
    PROCEEDINGS OF THE AMERICAN ASSOCIATION FOR CANCER RESEARCH, 1985, 26 (MAR): : 100 - 100
  • [5] EUKARYOTE GENES IN ESCHERICHIA-COLI
    SHERRATT, D
    NATURE, 1975, 255 (5509) : 523 - 524
  • [6] RIBOSOMAL GENES IN ESCHERICHIA-COLI
    LINDAHL, L
    ZENGEL, JM
    ANNUAL REVIEW OF GENETICS, 1986, 20 : 297 - 326
  • [7] Modeling and predicting transcriptional units of Escherichia coli genes using hidden Markov models
    Yada, T
    Nakao, M
    Totoki, Y
    Nakai, K
    BIOINFORMATICS, 1999, 15 (12) : 987 - 993
  • [8] REGULATION OF THE GENES FOR ESCHERICHIA-COLI DNA GYRASE - HOMEOSTATIC CONTROL OF DNA SUPERCOILING
    MENZEL, R
    GELLERT, M
    CELL, 1983, 34 (01) : 105 - 113
  • [9] REPLICATION OF ESCHERICHIA-COLI DNA
    TESSLER, PM
    SALIVAR, WO
    LOOS, JL
    ARCHIV FUR MIKROBIOLOGIE, 1972, 84 (02): : 161 - &
  • [10] DNA HELICASES OF ESCHERICHIA-COLI
    MATSON, SW
    PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY, 1991, 40 : 289 - 326