iMethylK-PseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC via Chou's 5-steps Rule

被引:33
作者
Ilyas, Sarah [1 ]
Hussain, Waqar [1 ]
Ashraf, Adeel [1 ]
Khan, Yaser Daanial [1 ]
Khan, Sher Afzal [2 ,4 ]
Chou, Kuo-Chen [3 ]
机构
[1] Univ Management & Technol, Sch Syst & Technol, Dept Comp Sci, POB 10033,C-2, Lahore 54770, Pakistan
[2] Fac Comp & Informat Technol Rabigh, Jeddah 21577, Saudi Arabia
[3] Gordon Life Sci Inst, Boston, MA 02478 USA
[4] Abdul Wali Khan Univ, Dept Comp Sci, Mardan, Pakistan
关键词
Methylation; lysine methylation; PseAAC; statistical moments; 5-steps rule; prediction; PREDICT SUBCELLULAR-LOCALIZATION; IDENTIFY RECOMBINATION SPOTS; LABEL LEARNING CLASSIFIER; SEQUENCE-BASED PREDICTOR; CRITICAL SPHERICAL-SHELL; S-NITROSYLATION SITES; AMINO-ACID PAIRS; N-6-METHYLADENOSINE SITES; SUCCINYLATION SITES; HISTONE METHYLATION;
D O I
10.2174/1389202920666190809095206
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Methylation is one of the most important post-translational modifications in the human body which usually arises on lysine among the most intensely modified residues. It performs a dynamic role in numerous biological procedures, such as regulation of gene expression, regulation of protein function and RNA processing. Therefore, to identity lysine methylation sites is an important challenge as some experimental procedures are time-consuming. Objective: Herein, we propose a computational predictor named iMethylK-PseAAC to identify lysine methylation sites. Methods: Firstly, we constructed feature vectors based on PseAAC using position and composition relative features and statistical moments. A neural network is trained based on the extracted features. The performance of the proposed method is then validated using cross-validation and jackknife testing. Results: The objective evaluation of the predictor showed accuracy of 96.7% for self-consistency, 91.61 A, for 10-fold cross-validation and 93.42% for jackknife testing. Conclusion: It is concluded that iMethylK-PseAAC outperforms the counterparts to identify lysine methylation sites such as iMethyl-PseACC, BPB-PPMS and PMeS,
引用
收藏
页码:275 / 292
页数:18
相关论文
共 172 条
[81]   SPalmitoylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-palmitoylation sites in proteins [J].
Hussain, Waqar ;
Khan, Yaser Daanial ;
Rasool, Nouman ;
Khan, Sher Afzal ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2019, 568 :14-23
[82]   SPrenylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-prenylation sites in proteins [J].
Hussain, Waqpar ;
Khan, Yaser Daanial ;
Rasool, Nouman ;
Khan, Sher Afzal ;
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2019, 468 :1-11
[83]   Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC [J].
Javed, Faisal ;
Hayat, Maqsood .
GENOMICS, 2019, 111 (06) :1325-1332
[84]   Prediction of Protein S-Nitrosylation Sites Based on Adapted Normal Distribution Bi-Profile Bayes and Chou's Pseudo Amino Acid Composition [J].
Jia, Cangzhi ;
Lin, Xin ;
Wang, Zhiping .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (06) :10410-10423
[85]   iPPI-PseAAC(CGR): Identify protein-protein interactions by incorporating chaos game representation into PseAAC [J].
Jia, Jianhua ;
Li, Xiaoyan ;
Qiu, Wangren ;
Xiao, Xuan ;
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2019, 460 :195-203
[86]   pSumo-CD: predicting sumoylation sites in proteins with covariance discriminant algorithm by incorporating sequence-coupled effects into general PseAAC [J].
Jia, Jianhua ;
Zhang, Liuxia ;
Liu, Zi ;
Xiao, Xuan ;
Chou, Kuo-Chen .
BIOINFORMATICS, 2016, 32 (20) :3133-3141
[87]   iCar-PseCp: identify carbonylation sites in proteins by Monto Carlo sampling and incorporating sequence coupled effects into general PseAAC [J].
Jia, Jianhua ;
Liu, Zi ;
Xiao, Xuan ;
Liu, Bingxiang ;
Chou, Kuo-Chen .
ONCOTARGET, 2016, 7 (23) :34558-34570
[88]   pSuc-Lys: Predict lysine succinylation sites in proteins with PseAAC and ensemble random forest approach [J].
Jia, Jianhua ;
Liu, Zi ;
Xiao, Xuan ;
Liu, Bingxiang ;
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2016, 394 :223-230
[89]   iSuc-PseOpt: Identifying lysine succinylation sites in proteins by incorporating sequence-coupling effects into pseudo components and optimizing imbalanced training dataset [J].
Jia, Jianhua ;
Liu, Zi ;
Xiao, Xuan ;
Liu, Bingxiang ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2016, 497 :48-56
[90]   Identification of protein-protein binding sites by incorporating the physicochemical properties and stationary wavelet transforms into pseudo amino acid composition [J].
Jia, Jianhua ;
Liu, Zi ;
Xiao, Xuan ;
Liu, Bingxiang ;
Chou, Kuo-Chen .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2016, 34 (09) :1946-1961