iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition

被引:151
作者
Chen, Wei [1 ,2 ,4 ]
Ding, Hui [3 ]
Zhou, Xu [1 ]
Lin, Hao [3 ,4 ]
Chou, Kuo-Chen [3 ,4 ]
机构
[1] North China Univ Sci & Technol, Ctr Genom & Computat Biol, Sch Sci, Tangshan 063000, Peoples R China
[2] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
[3] Univ Elect Sci & Technol China, Key Lab Neuroinformat, Ctr Informat Biol, Minist Educ,Sch Life Sci & Technol, Chengdu 610054, Sichuan, Peoples R China
[4] Gordon Life Sci Inst, Boston, MA 02478 USA
关键词
N-6-methyladenosine; Pseudo nucleotide composition; RNA modification; Support vector machine; 5-step rules; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; LYSINE SUCCINYLATION SITES; PROTEIN-STRUCTURE CLASSES; ALIGNMENT-FREE METHOD; CHOUS GENERAL PSEAAC; 3 DIFFERENT MODES; RECOMBINATION SPOTS; K-TUPLE; SUBCELLULAR-LOCALIZATION;
D O I
10.1016/j.ab.2018.09.002
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As a prevalent post-transcriptional modification, N-6-methyladenosine (m(6)A) plays key roles in a series of biological processes. Although experimental technologies have been developed and applied to identify m(6)A sites, they are still cost-ineffective for transcriptome-wide detections of m(6)A. As good complements to the experimental techniques, some computational methods have been proposed to identify m(6)A sites. However, their performance remains unsatisfactory. In this study, we firstly proposed an Euclidean distance based method to construct a high quality benchmark dataset. By encoding the RNA sequences using pseudo nucleotide composition, a new predictor called iRNA(m6A)-PseDNC was developed to identify m(6)A sites in the Saccharomyces cerevisiae genome. It has been demonstrated by the 10-fold cross validation test that the performance of iRNA(m6A)-PseDNC is superior to the existing methods. Meanwhile, for the convenience of most experimental scientists, established at the site http://lin-group.cn/server/iRNA(m6A)-PseDNC.php is its web-server, by which users can easily get their desired results without need to go through the detailed mathematics. It is anticipated that iRNA(m6A)-PseDNC will become a useful high throughput tool for identifying m(6)A sites in the S. cerevisiae genome.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 174 条
  • [41] pLoc-mAnimal: predict subcellular localization of animal proteins with both single and multiple sites
    Cheng, Xiang
    Zhao, Shu-Guang
    Lin, Wei-Zhong
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2017, 33 (22) : 3524 - 3531
  • [42] iATC-mHyb: a hybrid multi-label classifier for predicting the classification of anatomical therapeutic chemicals
    Cheng, Xiang
    Zhao, Shu-Guang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2017, 8 (35) : 58494 - 58503
  • [43] pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. MOLECULAR BIOSYSTEMS, 2017, 13 (09) : 1722 - 1727
  • [44] iATC-mISF: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals (vlo 33, pg 341, 2017)
    Cheng, Xiang
    Zhao, Shu-Guang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2017, 33 (16) : 2610 - 2610
  • [45] iATC-mISF: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals
    Cheng, Xiang
    Zhao, Shu-Guang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2017, 33 (03) : 341 - 346
  • [46] GRAPHICAL RULES FOR ENZYME-CATALYZED RATE LAWS
    CHOU, KC
    FORSEN, S
    [J]. BIOCHEMICAL JOURNAL, 1980, 187 (03) : 829 - 835
  • [47] Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes
    Chou, KC
    [J]. BIOINFORMATICS, 2005, 21 (01) : 10 - 19
  • [48] Predicting protein quaternary structure by pseudo amino acid composition
    Chou, KC
    Cai, YD
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 53 (02): : 282 - 289
  • [49] A new hybrid approach to predict subcellular localization of proteins by incorporating gene ontology
    Chou, KC
    Cai, YD
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 311 (03) : 743 - 747
  • [50] Prediction of signal peptides using scaled window
    Chou, KC
    [J]. PEPTIDES, 2001, 22 (12) : 1973 - 1979