iMethylK-PseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC via Chou's 5-steps Rule

被引:33
作者
Ilyas, Sarah [1 ]
Hussain, Waqar [1 ]
Ashraf, Adeel [1 ]
Khan, Yaser Daanial [1 ]
Khan, Sher Afzal [2 ,4 ]
Chou, Kuo-Chen [3 ]
机构
[1] Univ Management & Technol, Sch Syst & Technol, Dept Comp Sci, POB 10033,C-2, Lahore 54770, Pakistan
[2] Fac Comp & Informat Technol Rabigh, Jeddah 21577, Saudi Arabia
[3] Gordon Life Sci Inst, Boston, MA 02478 USA
[4] Abdul Wali Khan Univ, Dept Comp Sci, Mardan, Pakistan
关键词
Methylation; lysine methylation; PseAAC; statistical moments; 5-steps rule; prediction; PREDICT SUBCELLULAR-LOCALIZATION; IDENTIFY RECOMBINATION SPOTS; LABEL LEARNING CLASSIFIER; SEQUENCE-BASED PREDICTOR; CRITICAL SPHERICAL-SHELL; S-NITROSYLATION SITES; AMINO-ACID PAIRS; N-6-METHYLADENOSINE SITES; SUCCINYLATION SITES; HISTONE METHYLATION;
D O I
10.2174/1389202920666190809095206
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Methylation is one of the most important post-translational modifications in the human body which usually arises on lysine among the most intensely modified residues. It performs a dynamic role in numerous biological procedures, such as regulation of gene expression, regulation of protein function and RNA processing. Therefore, to identity lysine methylation sites is an important challenge as some experimental procedures are time-consuming. Objective: Herein, we propose a computational predictor named iMethylK-PseAAC to identify lysine methylation sites. Methods: Firstly, we constructed feature vectors based on PseAAC using position and composition relative features and statistical moments. A neural network is trained based on the extracted features. The performance of the proposed method is then validated using cross-validation and jackknife testing. Results: The objective evaluation of the predictor showed accuracy of 96.7% for self-consistency, 91.61 A, for 10-fold cross-validation and 93.42% for jackknife testing. Conclusion: It is concluded that iMethylK-PseAAC outperforms the counterparts to identify lysine methylation sites such as iMethyl-PseACC, BPB-PPMS and PMeS,
引用
收藏
页码:275 / 292
页数:18
相关论文
共 172 条
[61]   Some remarks on protein attribute prediction and pseudo amino acid composition [J].
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) :236-247
[62]   Graphic Rule for Drug Metabolism Systems [J].
Chou, Kuo-Chen .
CURRENT DRUG METABOLISM, 2010, 11 (04) :369-378
[63]  
Chou Kuo-Chen., 2011, Natural Science, V3, P862
[64]  
Chou Kuo-Chen., 2009, Natural Science, V1, P63
[65]   Protein methylation [J].
Clarke, Steven .
CURRENT OPINION IN CELL BIOLOGY, 1993, 5 (06) :977-983
[66]   Predicting structural classes of proteins by incorporating their global and local physicochemical and conformational properties into general Chou's PseAAC [J].
Contreras-Torres, Ernesto .
JOURNAL OF THEORETICAL BIOLOGY, 2018, 454 :139-145
[67]   WebLogo: A sequence logo generator [J].
Crooks, GE ;
Hon, G ;
Chandonia, JM ;
Brenner, SE .
GENOME RESEARCH, 2004, 14 (06) :1188-1190
[68]   iCTX-Type: A Sequence-Based Predictor for Identifying the Types of Conotoxins in Targeting Ion Channels [J].
Ding, Hui ;
Deng, En-Ze ;
Yuan, Lu-Feng ;
Liu, Li ;
Lin, Hao ;
Chen, Wei ;
Chou, Kuo-Chen .
BIOMED RESEARCH INTERNATIONAL, 2014, 2014
[69]   A Novel Modeling in Mathematical Biology for Classification of Signal Peptides [J].
Ehsan, Asma ;
Mahmood, Khalid ;
Khan, Yaser Daanial ;
Khan, Sher Afzal ;
Chou, Kuo-Chen .
SCIENTIFIC REPORTS, 2018, 8
[70]   iHSP-PseRAAAC: Identifying the heat shock protein families using pseudo reduced amino acid alphabet composition [J].
Feng, Peng-Mian ;
Chen, Wei ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2013, 442 (01) :118-125