iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition

被引:151
作者
Chen, Wei [1 ,2 ,4 ]
Ding, Hui [3 ]
Zhou, Xu [1 ]
Lin, Hao [3 ,4 ]
Chou, Kuo-Chen [3 ,4 ]
机构
[1] North China Univ Sci & Technol, Ctr Genom & Computat Biol, Sch Sci, Tangshan 063000, Peoples R China
[2] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
[3] Univ Elect Sci & Technol China, Key Lab Neuroinformat, Ctr Informat Biol, Minist Educ,Sch Life Sci & Technol, Chengdu 610054, Sichuan, Peoples R China
[4] Gordon Life Sci Inst, Boston, MA 02478 USA
关键词
N-6-methyladenosine; Pseudo nucleotide composition; RNA modification; Support vector machine; 5-step rules; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; LYSINE SUCCINYLATION SITES; PROTEIN-STRUCTURE CLASSES; ALIGNMENT-FREE METHOD; CHOUS GENERAL PSEAAC; 3 DIFFERENT MODES; RECOMBINATION SPOTS; K-TUPLE; SUBCELLULAR-LOCALIZATION;
D O I
10.1016/j.ab.2018.09.002
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As a prevalent post-transcriptional modification, N-6-methyladenosine (m(6)A) plays key roles in a series of biological processes. Although experimental technologies have been developed and applied to identify m(6)A sites, they are still cost-ineffective for transcriptome-wide detections of m(6)A. As good complements to the experimental techniques, some computational methods have been proposed to identify m(6)A sites. However, their performance remains unsatisfactory. In this study, we firstly proposed an Euclidean distance based method to construct a high quality benchmark dataset. By encoding the RNA sequences using pseudo nucleotide composition, a new predictor called iRNA(m6A)-PseDNC was developed to identify m(6)A sites in the Saccharomyces cerevisiae genome. It has been demonstrated by the 10-fold cross validation test that the performance of iRNA(m6A)-PseDNC is superior to the existing methods. Meanwhile, for the convenience of most experimental scientists, established at the site http://lin-group.cn/server/iRNA(m6A)-PseDNC.php is its web-server, by which users can easily get their desired results without need to go through the detailed mathematics. It is anticipated that iRNA(m6A)-PseDNC will become a useful high throughput tool for identifying m(6)A sites in the S. cerevisiae genome.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 174 条