Identification of coding and non-coding sequences using local Holder exponent formalism

被引:19
|
作者
Kulkarni, OC [1 ]
Vigneshwar, R [1 ]
Jayaraman, VK [1 ]
Kulkarni, BD [1 ]
机构
[1] Natl Chem Lab, Pune 411008, Maharashtra, India
关键词
D O I
10.1093/bioinformatics/bti639
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Accurate prediction of genes in genomes has always been a challenging task for bioinformaticians and computational biologists. The discovery of existence of distinct scaling relations in coding and non-coding sequences has led to new perspectives in the understanding of the DNA sequences. This has motivated us to exploit the differences in the local singularity distributions for characterization and classification of coding and non-coding sequences. Results: The local singularity density distribution in the coding and non-coding sequences of four genomes was first estimated using the wavelet transform modulus maxima methodology. Support vector machines classifier was then trained with the extracted features. The trained classifier is able to provide an average test accuracy of 97.7%. The local singularity features in a DNA sequence can be exploited for successful identification of coding and non-coding sequences.
引用
收藏
页码:3818 / 3823
页数:6
相关论文
共 50 条
  • [1] Local Symmetry of Non-Coding Genetic Sequences
    Radavicius, Marijus
    Rekasius, Tomas
    Zidanaviciute, Jurgita
    INFORMATICA, 2019, 30 (03) : 553 - 571
  • [2] Scaling properties of coding and non-coding DNA sequences
    Provata, A
    Almirantis, Y
    PHYSICA A, 1997, 247 (1-4): : 482 - 496
  • [3] Levy statistics in coding and non-coding nucleotide sequences
    Scafetta, N
    Latora, V
    Grigolini, P
    PHYSICS LETTERS A, 2002, 299 (5-6) : 565 - 570
  • [4] Homology in coding and non-coding DNA sequences: a parsimony perspective
    Ochoterena, Helga
    PLANT SYSTEMATICS AND EVOLUTION, 2009, 282 (3-4) : 151 - 168
  • [5] Universal Features for the Classification of Coding and Non-coding DNA Sequences
    Carels, Nicolas
    Vidal, Ramon
    Fras, Diego
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2009, 3 : 37 - 49
  • [6] Homology in coding and non-coding DNA sequences: a parsimony perspective
    Helga Ochoterena
    Plant Systematics and Evolution, 2009, 282 : 151 - 168
  • [7] BASiNET-BiologicAl Sequences NETwork: a case study on coding and non-coding RNAs identification
    Ito, Eric Augusto
    Katahira, Isaque
    da Rocha Vicente, Fabio Fernandes
    Protasio Pereira, Luiz Filipe
    Lopes, Fabricio Martins
    NUCLEIC ACIDS RESEARCH, 2018, 46 (16)
  • [8] REPETITIVE SEQUENCES AND NON-CODING DNA
    WALKER, PMB
    HEREDITY, 1978, 41 (DEC) : 414 - 414
  • [9] Identification of non-coding RNA
    Gaspin, C
    Thébault, P
    BIOFUTUR, 2005, (251) : 33 - 36
  • [10] Identification of non-coding and coding RNAs in porcine endometrium
    Wang, Yueying
    Hu, Tao
    Wu, Lihang
    Liu, Xiaoran
    Xue, Songyi
    Lei, Minggang
    GENOMICS, 2017, 109 (01) : 43 - 50