iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition

被引:155
作者
Chen, Wei [1 ,2 ,4 ]
Ding, Hui [3 ]
Zhou, Xu [1 ]
Lin, Hao [3 ,4 ]
Chou, Kuo-Chen [3 ,4 ]
机构
[1] North China Univ Sci & Technol, Ctr Genom & Computat Biol, Sch Sci, Tangshan 063000, Peoples R China
[2] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
[3] Univ Elect Sci & Technol China, Key Lab Neuroinformat, Ctr Informat Biol, Minist Educ,Sch Life Sci & Technol, Chengdu 610054, Sichuan, Peoples R China
[4] Gordon Life Sci Inst, Boston, MA 02478 USA
关键词
N-6-methyladenosine; Pseudo nucleotide composition; RNA modification; Support vector machine; 5-step rules; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; LYSINE SUCCINYLATION SITES; PROTEIN-STRUCTURE CLASSES; ALIGNMENT-FREE METHOD; CHOUS GENERAL PSEAAC; 3 DIFFERENT MODES; RECOMBINATION SPOTS; K-TUPLE; SUBCELLULAR-LOCALIZATION;
D O I
10.1016/j.ab.2018.09.002
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As a prevalent post-transcriptional modification, N-6-methyladenosine (m(6)A) plays key roles in a series of biological processes. Although experimental technologies have been developed and applied to identify m(6)A sites, they are still cost-ineffective for transcriptome-wide detections of m(6)A. As good complements to the experimental techniques, some computational methods have been proposed to identify m(6)A sites. However, their performance remains unsatisfactory. In this study, we firstly proposed an Euclidean distance based method to construct a high quality benchmark dataset. By encoding the RNA sequences using pseudo nucleotide composition, a new predictor called iRNA(m6A)-PseDNC was developed to identify m(6)A sites in the Saccharomyces cerevisiae genome. It has been demonstrated by the 10-fold cross validation test that the performance of iRNA(m6A)-PseDNC is superior to the existing methods. Meanwhile, for the convenience of most experimental scientists, established at the site http://lin-group.cn/server/iRNA(m6A)-PseDNC.php is its web-server, by which users can easily get their desired results without need to go through the detailed mathematics. It is anticipated that iRNA(m6A)-PseDNC will become a useful high throughput tool for identifying m(6)A sites in the S. cerevisiae genome.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 174 条
[91]   Suppression of RNA recognition by Toll-like receptors:: The impact of nucleoside modification and the evolutionary origin of RNA [J].
Karikó, K ;
Buckstein, M ;
Ni, HP ;
Weissman, D .
IMMUNITY, 2005, 23 (02) :165-175
[92]   iPhosT-PseAAC: Identify phosphothreonine sites by incorporating sequence statistical moments into PseAAC [J].
Khan, Yaser Daanial ;
Rasool, Nouman ;
Hussain, Waqar ;
Khan, Sher Afzal ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2018, 550 :109-116
[93]   Predicting Antibacterial Peptides by the Concept of Chou's Pseudo-amino Acid Composition and Machine Learning Methods [J].
Khosravian, Maede ;
Faramarzi, Fateme Kazemi ;
Beigi, Majid Mohammad ;
Behbahani, Mandana ;
Mohabatkar, Hassan .
PROTEIN AND PEPTIDE LETTERS, 2013, 20 (02) :180-186
[94]   Using Chou's general PseAAC to analyze the evolutionary relationship of receptor associated proteins (RAP) with various folding patterns of protein domains [J].
Krishnan, S. Muthu .
JOURNAL OF THEORETICAL BIOLOGY, 2018, 445 :62-74
[95]   Prediction of β-lactamase and its class by Chou's pseudo-amino acid composition and support vector machine [J].
Kumar, Ravindra ;
Srivastava, Abhishikha ;
Kumari, Bandana ;
Kumar, Manish .
JOURNAL OF THEORETICAL BIOLOGY, 2015, 365 :96-103
[96]  
Lahbabi N, 2017, IEEE ICC
[97]   Quokka: a comprehensive tool for rapid and accurate prediction of kinase family-specific phosphorylation sites in the human proteome [J].
Li, Fuyi ;
Li, Chen ;
Marquez-Lago, Tatiana T. ;
Leier, Andre ;
Akutsu, Tatsuya ;
Purcell, Anthony W. ;
Smith, A. Ian ;
Lithgow, Trevor ;
Daly, Roger J. ;
Song, Jiangning ;
Chou, Kuo-Chen .
BIOINFORMATICS, 2018, 34 (24) :4223-4231
[98]   Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence [J].
Liang, Yunyun ;
Zhang, Shengli .
JOURNAL OF THEORETICAL BIOLOGY, 2018, 454 :22-29
[99]   iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition [J].
Lin, Hao ;
Deng, En-Ze ;
Ding, Hui ;
Chen, Wei ;
Chou, Kuo-Chen .
NUCLEIC ACIDS RESEARCH, 2014, 42 (21) :12961-12972
[100]  
Linder B, 2015, NAT METHODS, V12, P767, DOI [10.1038/NMETH.3453, 10.1038/nmeth.3453]