iMethylK-PseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC via Chou's 5-steps Rule

被引:33
作者
Ilyas, Sarah [1 ]
Hussain, Waqar [1 ]
Ashraf, Adeel [1 ]
Khan, Yaser Daanial [1 ]
Khan, Sher Afzal [2 ,4 ]
Chou, Kuo-Chen [3 ]
机构
[1] Univ Management & Technol, Sch Syst & Technol, Dept Comp Sci, POB 10033,C-2, Lahore 54770, Pakistan
[2] Fac Comp & Informat Technol Rabigh, Jeddah 21577, Saudi Arabia
[3] Gordon Life Sci Inst, Boston, MA 02478 USA
[4] Abdul Wali Khan Univ, Dept Comp Sci, Mardan, Pakistan
关键词
Methylation; lysine methylation; PseAAC; statistical moments; 5-steps rule; prediction; PREDICT SUBCELLULAR-LOCALIZATION; IDENTIFY RECOMBINATION SPOTS; LABEL LEARNING CLASSIFIER; SEQUENCE-BASED PREDICTOR; CRITICAL SPHERICAL-SHELL; S-NITROSYLATION SITES; AMINO-ACID PAIRS; N-6-METHYLADENOSINE SITES; SUCCINYLATION SITES; HISTONE METHYLATION;
D O I
10.2174/1389202920666190809095206
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Methylation is one of the most important post-translational modifications in the human body which usually arises on lysine among the most intensely modified residues. It performs a dynamic role in numerous biological procedures, such as regulation of gene expression, regulation of protein function and RNA processing. Therefore, to identity lysine methylation sites is an important challenge as some experimental procedures are time-consuming. Objective: Herein, we propose a computational predictor named iMethylK-PseAAC to identify lysine methylation sites. Methods: Firstly, we constructed feature vectors based on PseAAC using position and composition relative features and statistical moments. A neural network is trained based on the extracted features. The performance of the proposed method is then validated using cross-validation and jackknife testing. Results: The objective evaluation of the predictor showed accuracy of 96.7% for self-consistency, 91.61 A, for 10-fold cross-validation and 93.42% for jackknife testing. Conclusion: It is concluded that iMethylK-PseAAC outperforms the counterparts to identify lysine methylation sites such as iMethyl-PseACC, BPB-PPMS and PMeS,
引用
收藏
页码:275 / 292
页数:18
相关论文
共 172 条
[1]   iMethyl-STTNC: Identification of N6-methyladenosine sites by extending the idea of SAAC into Chou's PseAAC to formulate RNA sequences [J].
Akbar, Shahid ;
Hayat, Maqsood .
JOURNAL OF THEORETICAL BIOLOGY, 2018, 455 :205-211
[2]   Prediction of N-linked glycosylation sites using position relative features and statistical moments [J].
Akmal, Muhammad Aizaz ;
Rasool, Nouman ;
Khan, Yaser Daanial .
PLOS ONE, 2017, 12 (08)
[3]  
ALTHAUS IW, 1993, J BIOL CHEM, V268, P14875
[4]  
ALTHAUS IW, 1993, J BIOL CHEM, V268, P6119
[5]   KINETIC-STUDIES WITH THE NONNUCLEOSIDE HUMAN-IMMUNODEFICIENCY-VIRUS TYPE-1 REVERSE-TRANSCRIPTASE INHIBITOR U-90152E [J].
ALTHAUS, IW ;
CHOU, JJ ;
GONZALES, AJ ;
DEIBEL, MR ;
CHOU, KC ;
KEZDY, FJ ;
ROMERO, DL ;
THOMAS, RC ;
ARISTOFF, PA ;
TARPLEY, WG ;
REUSSER, F .
BIOCHEMICAL PHARMACOLOGY, 1994, 47 (11) :2017-2028
[6]  
[Anonymous], BRIEF BIOINFORM
[7]  
[Anonymous], J PHYS CHEM
[8]  
[Anonymous], BIOMED RES INT
[9]  
[Anonymous], BIOINFORMATICS
[10]  
[Anonymous], BIOINFORMATICS