iMethylK-PseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC via Chou's 5-steps Rule

被引:33
作者
Ilyas, Sarah [1 ]
Hussain, Waqar [1 ]
Ashraf, Adeel [1 ]
Khan, Yaser Daanial [1 ]
Khan, Sher Afzal [2 ,4 ]
Chou, Kuo-Chen [3 ]
机构
[1] Univ Management & Technol, Sch Syst & Technol, Dept Comp Sci, POB 10033,C-2, Lahore 54770, Pakistan
[2] Fac Comp & Informat Technol Rabigh, Jeddah 21577, Saudi Arabia
[3] Gordon Life Sci Inst, Boston, MA 02478 USA
[4] Abdul Wali Khan Univ, Dept Comp Sci, Mardan, Pakistan
关键词
Methylation; lysine methylation; PseAAC; statistical moments; 5-steps rule; prediction; PREDICT SUBCELLULAR-LOCALIZATION; IDENTIFY RECOMBINATION SPOTS; LABEL LEARNING CLASSIFIER; SEQUENCE-BASED PREDICTOR; CRITICAL SPHERICAL-SHELL; S-NITROSYLATION SITES; AMINO-ACID PAIRS; N-6-METHYLADENOSINE SITES; SUCCINYLATION SITES; HISTONE METHYLATION;
D O I
10.2174/1389202920666190809095206
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Methylation is one of the most important post-translational modifications in the human body which usually arises on lysine among the most intensely modified residues. It performs a dynamic role in numerous biological procedures, such as regulation of gene expression, regulation of protein function and RNA processing. Therefore, to identity lysine methylation sites is an important challenge as some experimental procedures are time-consuming. Objective: Herein, we propose a computational predictor named iMethylK-PseAAC to identify lysine methylation sites. Methods: Firstly, we constructed feature vectors based on PseAAC using position and composition relative features and statistical moments. A neural network is trained based on the extracted features. The performance of the proposed method is then validated using cross-validation and jackknife testing. Results: The objective evaluation of the predictor showed accuracy of 96.7% for self-consistency, 91.61 A, for 10-fold cross-validation and 93.42% for jackknife testing. Conclusion: It is concluded that iMethylK-PseAAC outperforms the counterparts to identify lysine methylation sites such as iMethyl-PseACC, BPB-PPMS and PMeS,
引用
收藏
页码:275 / 292
页数:18
相关论文
共 172 条
[21]   iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition [J].
Chen, Wei ;
Ding, Hui ;
Zhou, Xu ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2018, 561 :59-65
[22]   iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites [J].
Chen, Wei ;
Feng, Pengmian ;
Yang, Hui ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 11 :468-474
[23]   iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties [J].
Chen, Wei ;
Yang, Hui ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao .
BIOINFORMATICS, 2017, 33 (22) :3518-3523
[24]   iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences [J].
Chen, Wei ;
Feng, Pengmian ;
Yang, Hui ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
ONCOTARGET, 2017, 8 (03) :4208-4217
[25]   Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines [J].
Chen, Wei ;
Xing, Pengwei ;
Zou, Quan .
SCIENTIFIC REPORTS, 2017, 7
[26]   iRNA-PseU: Identifying RNA pseudouridine sites [J].
Chen, Wei ;
Tang, Hua ;
Ye, Jing ;
Lin, Hao ;
Chou, Kuo-Chen .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2016, 5 :e332
[27]   IACP: a sequence-based tool for identifying anticancer peptides [J].
Chen, Wei ;
Ding, Hui ;
Feng, Pengmian ;
Lin, Hao ;
Chou, Kuo-Chen .
ONCOTARGET, 2016, 7 (13) :16895-16909
[28]   Using deformation energy to analyze nucleosome positioning in genomes [J].
Chen, Wei ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
GENOMICS, 2016, 107 (2-3) :69-75
[29]   iRNA-Methyl: Identifying N6-methyladenosine sites using pseudo nucleotide composition [J].
Chen, Wei ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2015, 490 :26-33
[30]   iTIS-PseTNC: A sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Deng, En-Ze ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2014, 462 :76-83