Detecting overlapping coding sequences in virus genomes

被引:72
|
作者
Firth, AE [1 ]
Brown, CM [1 ]
机构
[1] Univ Otago, Dept Biochem, Dunedin, New Zealand
关键词
D O I
10.1186/1471-2105-7-75
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Detecting new coding sequences (CDSs) in viral genomes can be difficult for several reasons. The typically compact genomes often contain a number of overlapping coding and non-coding functional elements, which can result in unusual patterns of codon usage; conservation between related sequences can be difficult to interpret-especially within overlapping genes; and viruses often employ non-canonical translational mechanisms-e.g. frameshifting, stop codon readthrough, leaky-scanning and internal ribosome entry sites-which can conceal potentially coding open reading frames (ORFs). Results: In a previous paper we introduced a new statistic-MLOGD (Maximum Likelihood Overlapping Gene Detector)-for detecting and analysing overlapping CDSs. Here we present (a) an improved MLOGD statistic, (b) a greatly extended suite of software using MLOGD, (c) a database of results for 640 virus sequence alignments, and (d) a web-interface to the software and database. Tests show that, from an alignment with just 20 mutations, MLOGD can discriminate non-overlapping CDSs from non-coding ORFs with a typical accuracy of up to 98%, and can detect CDSs overlapping known CDSs with a typical accuracy of 90%. In addition, the software produces a variety of statistics and graphics, useful for analysing an input multiple sequence alignment. Conclusion: MLOGD is an easy-to-use tool for virus genome annotation, detecting new CDSs in particular overlapping or short CDSs-and for analysing overlapping CDSs following frameshift sites. The software, web-server, database and supplementary material are available at http://guinevere.otago.ac.nz/mlogd.html.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Seforta, an integrated tool for detecting the signature of selection in coding sequences
    Camiolo S.
    Melito S.
    Milia G.
    Porceddu A.
    BMC Research Notes, 7 (1)
  • [22] A Coding Theoretic Model for Error-detecting in DNA Sequences
    Debata, Prajna Paramita
    Mishra, Debahuti
    Shaw, Kailash
    Mishra, Sashikala
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 1773 - 1777
  • [23] A PROBABILISTIC MODEL FOR DETECTING CODING REGIONS IN DNA-SEQUENCES
    THOMAS, A
    SKOLNICK, MH
    IMA JOURNAL OF MATHEMATICS APPLIED IN MEDICINE AND BIOLOGY, 1994, 11 (03): : 149 - 160
  • [25] Sequences of Zika Virus Genomes from a Pediatric Cohort in Nicaragua
    Oldfield, Lauren M.
    Fedorova, Nadia
    Puri, Vinita
    Shrivastava, Susmita
    Amedeo, Paolo
    Durbin, Alan
    Rocchi, Iara
    Williams, Torrey
    Shabman, Reed S.
    Tan, Gene S.
    Balmaseda, Angel
    Kuan, Guillermina
    Saborio, Saira
    Gordon, Aubree
    Harris, Eva
    Pickett, Brett E.
    GENOME ANNOUNCEMENTS, 2018, 6 (24)
  • [26] Correlations between coding and contiguous non-coding sequences in isochore families from vertebrate genomes
    Costantini, Maria
    Bernardi, Giorgio
    GENE, 2008, 410 (02) : 241 - 248
  • [27] Detection of Signature Sequences in Overlapping Genes and Prediction of a Novel Overlapping Gene in Hepatitis G Virus
    Angelo Pavesi
    Journal of Molecular Evolution, 2000, 50 : 284 - 295
  • [28] PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes
    Paul, Sandip
    Bhardwaj, Archana
    Bag, Sumit K.
    Sokurenko, Evgeni V.
    Chattopadhyay, Sujay
    GENOMICS, 2015, 106 (06) : 367 - 372
  • [29] Overlapping genes in vertebrate genomes
    Makalowska, I
    Lin, CF
    Makalowski, W
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2005, 29 (01) : 1 - 12
  • [30] Reticuloendotheliosis virus sequences within the genomes of field strains of fowlpox virus display variability
    Singh, P
    Schnitzlein, WA
    Tripathy, DN
    JOURNAL OF VIROLOGY, 2003, 77 (10) : 5855 - 5862