Detecting overlapping coding sequences in virus genomes

被引:72
|
作者
Firth, AE [1 ]
Brown, CM [1 ]
机构
[1] Univ Otago, Dept Biochem, Dunedin, New Zealand
关键词
D O I
10.1186/1471-2105-7-75
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Detecting new coding sequences (CDSs) in viral genomes can be difficult for several reasons. The typically compact genomes often contain a number of overlapping coding and non-coding functional elements, which can result in unusual patterns of codon usage; conservation between related sequences can be difficult to interpret-especially within overlapping genes; and viruses often employ non-canonical translational mechanisms-e.g. frameshifting, stop codon readthrough, leaky-scanning and internal ribosome entry sites-which can conceal potentially coding open reading frames (ORFs). Results: In a previous paper we introduced a new statistic-MLOGD (Maximum Likelihood Overlapping Gene Detector)-for detecting and analysing overlapping CDSs. Here we present (a) an improved MLOGD statistic, (b) a greatly extended suite of software using MLOGD, (c) a database of results for 640 virus sequence alignments, and (d) a web-interface to the software and database. Tests show that, from an alignment with just 20 mutations, MLOGD can discriminate non-overlapping CDSs from non-coding ORFs with a typical accuracy of up to 98%, and can detect CDSs overlapping known CDSs with a typical accuracy of 90%. In addition, the software produces a variety of statistics and graphics, useful for analysing an input multiple sequence alignment. Conclusion: MLOGD is an easy-to-use tool for virus genome annotation, detecting new CDSs in particular overlapping or short CDSs-and for analysing overlapping CDSs following frameshift sites. The software, web-server, database and supplementary material are available at http://guinevere.otago.ac.nz/mlogd.html.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Overlapping genes in natural and engineered genomes
    Bradley W. Wright
    Mark P. Molloy
    Paul R. Jaschke
    Nature Reviews Genetics, 2022, 23 : 154 - 168
  • [42] Overlapping genes in bacterial and phage genomes
    Scherbakov, DV
    Garber, MB
    MOLECULAR BIOLOGY, 2000, 34 (04) : 485 - 495
  • [43] Overlapping genes in the human and mouse genomes
    Sanna, Chaitanya R.
    Li, Wen-Hsiung
    Zhang, Liqing
    BMC GENOMICS, 2008, 9 (1)
  • [44] Overlapping genes in natural and engineered genomes
    Wright, Bradley W.
    Molloy, Mark P.
    Jaschke, Paul R.
    NATURE REVIEWS GENETICS, 2022, 23 (03) : 154 - 168
  • [45] On dynamics of overlapping genes in bacterial genomes
    Fukuda, Y
    Nakayama, Y
    Tomita, M
    GENE, 2003, 323 : 181 - 187
  • [46] Overlapping genes in the human and mouse genomes
    Chaitanya R Sanna
    Wen-Hsiung Li
    Liqing Zhang
    BMC Genomics, 9
  • [47] Overlapping genes in bacterial and phage genomes
    D. V. Scherbakov
    M. B. Garber
    Molecular Biology, 2000, 34 : 485 - 495
  • [48] Occurrence, function and evolutionary origins of '2A-like' sequences in virus genomes
    Luke, Garry A.
    de Felipe, Pablo
    Lukashev, Alexander
    Kallioinen, Susanna E.
    Bruno, Elizabeth A.
    Ryan, Martin D.
    JOURNAL OF GENERAL VIROLOGY, 2008, 89 : 1036 - 1042
  • [49] Systematic identification and characterization of repeat sequences in African swine fever virus genomes
    Zhu, Zhaozhong
    Ge, Shengqiang
    Cai, Zena
    Wu, Yifan
    Lu, Congyu
    Zhang, Zheng
    Fu, Ping
    Mao, Longfei
    Wu, Xiaodong
    Peng, Yousong
    VETERINARY RESEARCH, 2022, 53 (01) : 101
  • [50] Abundance of G-Quadruplex Forming Sequences in the Hepatitis Delta Virus Genomes
    Brazda, Vaclav
    Valkova, Natalia
    Dobrovolna, Michaela
    Mergny, Jean-Louis
    ACS OMEGA, 2024, 9 (03): : 4096 - 4101