iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition

被引:151
作者
Chen, Wei [1 ,2 ,4 ]
Ding, Hui [3 ]
Zhou, Xu [1 ]
Lin, Hao [3 ,4 ]
Chou, Kuo-Chen [3 ,4 ]
机构
[1] North China Univ Sci & Technol, Ctr Genom & Computat Biol, Sch Sci, Tangshan 063000, Peoples R China
[2] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
[3] Univ Elect Sci & Technol China, Key Lab Neuroinformat, Ctr Informat Biol, Minist Educ,Sch Life Sci & Technol, Chengdu 610054, Sichuan, Peoples R China
[4] Gordon Life Sci Inst, Boston, MA 02478 USA
关键词
N-6-methyladenosine; Pseudo nucleotide composition; RNA modification; Support vector machine; 5-step rules; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; LYSINE SUCCINYLATION SITES; PROTEIN-STRUCTURE CLASSES; ALIGNMENT-FREE METHOD; CHOUS GENERAL PSEAAC; 3 DIFFERENT MODES; RECOMBINATION SPOTS; K-TUPLE; SUBCELLULAR-LOCALIZATION;
D O I
10.1016/j.ab.2018.09.002
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As a prevalent post-transcriptional modification, N-6-methyladenosine (m(6)A) plays key roles in a series of biological processes. Although experimental technologies have been developed and applied to identify m(6)A sites, they are still cost-ineffective for transcriptome-wide detections of m(6)A. As good complements to the experimental techniques, some computational methods have been proposed to identify m(6)A sites. However, their performance remains unsatisfactory. In this study, we firstly proposed an Euclidean distance based method to construct a high quality benchmark dataset. By encoding the RNA sequences using pseudo nucleotide composition, a new predictor called iRNA(m6A)-PseDNC was developed to identify m(6)A sites in the Saccharomyces cerevisiae genome. It has been demonstrated by the 10-fold cross validation test that the performance of iRNA(m6A)-PseDNC is superior to the existing methods. Meanwhile, for the convenience of most experimental scientists, established at the site http://lin-group.cn/server/iRNA(m6A)-PseDNC.php is its web-server, by which users can easily get their desired results without need to go through the detailed mathematics. It is anticipated that iRNA(m6A)-PseDNC will become a useful high throughput tool for identifying m(6)A sites in the S. cerevisiae genome.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 174 条
  • [131] iRNAm5C-PseDNC: identifying RNA 5-methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition
    Qiu, Wang-Ren
    Jiang, Shi-Yu
    Xu, Zhao-Chun
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2017, 8 (25) : 41178 - 41188
  • [132] iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
    Qiu, Wang-Ren
    Xiao, Xuan
    Xu, Zhao-Chun
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2016, 7 (32) : 51270 - 51283
  • [133] iHyd-PseCp: Identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC
    Qiu, Wang-Ren
    Sun, Bi-Qian
    Xiao, Xuan
    Xu, Zhao-Chun
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2016, 7 (28) : 44310 - 44321
  • [134] iPTM-mLys: identifying multiple lysine PTM sites and their different types
    Qiu, Wang-Ren
    Sun, Bi-Qian
    Xiao, Xuan
    Xu, Zhao-Chun
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2016, 32 (20) : 3116 - 3123
  • [135] iUbiq-Lys: prediction of lysine ubiquitination sites in proteins by extracting sequence evolution information via a gray system model
    Qiu, Wang-Ren
    Xiao, Xuan
    Lin, Wei-Zhong
    Chou, Kuo-Chen
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2015, 33 (08) : 1731 - 1742
  • [136] iMethyl-PseAAC: Identification of Protein Methylation Sites via a Pseudo Amino Acid Composition Approach
    Qiu, Wang-Ren
    Xiao, Xuan
    Lin, Wei-Zhong
    Chou, Kuo-Chen
    [J]. BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [137] iRSpot-TNCPseAAC: Identify Recombination Spots with Trinucleotide Composition and Pseudo Amino Acid Components
    Qiu, Wang-Ren
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (02) : 1746 - 1766
  • [138] OOgenesis_Pred: A sequence-based method for predicting oogenesis proteins by six different modes of Chou's pseudo amino acid composition
    Rahimi, Maryam
    Bakhtiarizadeh, Mohammad Reza
    Mohammadi-Sangcheshmeh, Abdollah
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2017, 414 : 128 - 136
  • [139] DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC
    Rahman, M. Saifur
    Shatabda, Swakkhar
    Saha, Sanjay
    Kaykobad, M.
    Rahman, M. Sohel
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2018, 452 : 22 - 34
  • [140] A novel feature representation method based on Chou's pseudo amino acid composition for protein structural class prediction
    Sahu, Sitanshu Sekhar
    Panda, Ganapati
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2010, 34 (5-6) : 320 - 327