iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition

被引:151
作者
Chen, Wei [1 ,2 ,4 ]
Ding, Hui [3 ]
Zhou, Xu [1 ]
Lin, Hao [3 ,4 ]
Chou, Kuo-Chen [3 ,4 ]
机构
[1] North China Univ Sci & Technol, Ctr Genom & Computat Biol, Sch Sci, Tangshan 063000, Peoples R China
[2] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
[3] Univ Elect Sci & Technol China, Key Lab Neuroinformat, Ctr Informat Biol, Minist Educ,Sch Life Sci & Technol, Chengdu 610054, Sichuan, Peoples R China
[4] Gordon Life Sci Inst, Boston, MA 02478 USA
关键词
N-6-methyladenosine; Pseudo nucleotide composition; RNA modification; Support vector machine; 5-step rules; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; LYSINE SUCCINYLATION SITES; PROTEIN-STRUCTURE CLASSES; ALIGNMENT-FREE METHOD; CHOUS GENERAL PSEAAC; 3 DIFFERENT MODES; RECOMBINATION SPOTS; K-TUPLE; SUBCELLULAR-LOCALIZATION;
D O I
10.1016/j.ab.2018.09.002
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As a prevalent post-transcriptional modification, N-6-methyladenosine (m(6)A) plays key roles in a series of biological processes. Although experimental technologies have been developed and applied to identify m(6)A sites, they are still cost-ineffective for transcriptome-wide detections of m(6)A. As good complements to the experimental techniques, some computational methods have been proposed to identify m(6)A sites. However, their performance remains unsatisfactory. In this study, we firstly proposed an Euclidean distance based method to construct a high quality benchmark dataset. By encoding the RNA sequences using pseudo nucleotide composition, a new predictor called iRNA(m6A)-PseDNC was developed to identify m(6)A sites in the Saccharomyces cerevisiae genome. It has been demonstrated by the 10-fold cross validation test that the performance of iRNA(m6A)-PseDNC is superior to the existing methods. Meanwhile, for the convenience of most experimental scientists, established at the site http://lin-group.cn/server/iRNA(m6A)-PseDNC.php is its web-server, by which users can easily get their desired results without need to go through the detailed mathematics. It is anticipated that iRNA(m6A)-PseDNC will become a useful high throughput tool for identifying m(6)A sites in the S. cerevisiae genome.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 174 条
  • [91] Suppression of RNA recognition by Toll-like receptors:: The impact of nucleoside modification and the evolutionary origin of RNA
    Karikó, K
    Buckstein, M
    Ni, HP
    Weissman, D
    [J]. IMMUNITY, 2005, 23 (02) : 165 - 175
  • [92] iPhosT-PseAAC: Identify phosphothreonine sites by incorporating sequence statistical moments into PseAAC
    Khan, Yaser Daanial
    Rasool, Nouman
    Hussain, Waqar
    Khan, Sher Afzal
    Chou, Kuo-Chen
    [J]. ANALYTICAL BIOCHEMISTRY, 2018, 550 : 109 - 116
  • [93] Predicting Antibacterial Peptides by the Concept of Chou's Pseudo-amino Acid Composition and Machine Learning Methods
    Khosravian, Maede
    Faramarzi, Fateme Kazemi
    Beigi, Majid Mohammad
    Behbahani, Mandana
    Mohabatkar, Hassan
    [J]. PROTEIN AND PEPTIDE LETTERS, 2013, 20 (02) : 180 - 186
  • [94] Using Chou's general PseAAC to analyze the evolutionary relationship of receptor associated proteins (RAP) with various folding patterns of protein domains
    Krishnan, S. Muthu
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2018, 445 : 62 - 74
  • [95] Prediction of β-lactamase and its class by Chou's pseudo-amino acid composition and support vector machine
    Kumar, Ravindra
    Srivastava, Abhishikha
    Kumari, Bandana
    Kumar, Manish
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2015, 365 : 96 - 103
  • [96] Lahbabi N, 2017, IEEE ICC
  • [97] Quokka: a comprehensive tool for rapid and accurate prediction of kinase family-specific phosphorylation sites in the human proteome
    Li, Fuyi
    Li, Chen
    Marquez-Lago, Tatiana T.
    Leier, Andre
    Akutsu, Tatsuya
    Purcell, Anthony W.
    Smith, A. Ian
    Lithgow, Trevor
    Daly, Roger J.
    Song, Jiangning
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2018, 34 (24) : 4223 - 4231
  • [98] Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence
    Liang, Yunyun
    Zhang, Shengli
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2018, 454 : 22 - 29
  • [99] iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition
    Lin, Hao
    Deng, En-Ze
    Ding, Hui
    Chen, Wei
    Chou, Kuo-Chen
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (21) : 12961 - 12972
  • [100] Linder B, 2015, NAT METHODS, V12, P767, DOI [10.1038/NMETH.3453, 10.1038/nmeth.3453]