Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features

被引:17
作者
Arafat, Md Easin [1 ]
Ahmad, Md Wakil [1 ]
Shovan, S. M. [2 ]
Dehzangi, Abdollah [3 ,4 ]
Dipta, Shubhashis Roy [1 ]
Hasan, Md Al Mehedi [2 ]
Taherzadeh, Ghazaleh [5 ]
Shatabda, Swakkhar [1 ]
Sharma, Alok [6 ,7 ,8 ,9 ]
机构
[1] United Int Univ, Dept Comp Sci & Engn, Dhaka 1212, Bangladesh
[2] Rajshahi Univ Engn & Technol, Dept Comp Sci & Engn, Rajshahi 6204, Bangladesh
[3] Rutgers State Univ, Dept Comp Sci, Camden, NJ 08102 USA
[4] Rutgers State Univ, Ctr Computat & Integrat Biol, Camden, NJ 08102 USA
[5] Univ Maryland, Inst Biosci & Biotechnol Res, College Pk, MD 20742 USA
[6] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld 4111, Australia
[7] Tokyo Med & Dent Univ TMDU, Dept Med Sci Math, Tokyo 1138510, Japan
[8] RIKEN Ctr Integrat Med Sci, Lab Med Sci Math, Yokohama, Kanagawa 2300045, Japan
[9] Univ South Pacific, Fac Sci Technol & Environm, Sch Engn & Phys, Suva, Fiji
关键词
post-translational modification; lysine Glutarylation; machine learning; extra-trees classifier; bi-peptide evolutionary features; LYSINE SUCCINYLATION SITES; HOMOLOGY-BASED PREDICTION; POSTTRANSLATIONAL MODIFICATIONS; SCORING MATRIX; MALONYLATION; PROTEINS; IDENTIFICATION; LOCALIZATION; LOCATIONS; RESIDUES;
D O I
10.3390/genes11091023
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Post Translational Modification (PTM) is defined as the alteration of protein sequence upon interaction with different macromolecules after the translation process. Glutarylation is considered one of the most important PTMs, which is associated with a wide range of cellular functioning, including metabolism, translation, and specified separate subcellular localizations. During the past few years, a wide range of computational approaches has been proposed to predict Glutarylation sites. However, despite all the efforts that have been made so far, the prediction performance of the Glutarylation sites has remained limited. One of the main challenges to tackle this problem is to extract features with significant discriminatory information. To address this issue, we propose a new machine learning method called BiPepGlut using the concept of a bi-peptide-based evolutionary method for feature extraction. To build this model, we also use the Extra-Trees (ET) classifier for the classification purpose, which, to the best of our knowledge, has never been used for this task. Our results demonstrate BiPepGlut is able to significantly outperform previously proposed models to tackle this problem. BiPepGlut achieves 92.0%, 84.8%, 95.6%, 0.82, and 0.88 in accuracy, sensitivity, specificity, Matthew's Correlation Coefficient, and F1-score, respectively. BiPepGlut is implemented as a publicly available online predictor.
引用
收藏
页数:16
相关论文
共 58 条
  • [11] iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences
    Chen, Zhen
    Zhao, Pei
    Li, Fuyi
    Leier, Andre
    Marquez-Lago, Tatiana T.
    Wang, Yanan
    Webb, Geoffrey I.
    Smith, A. Ian
    Daly, Roger J.
    Chou, Kuo-Chen
    Song, Jiangning
    [J]. BIOINFORMATICS, 2018, 34 (14) : 2499 - 2502
  • [12] iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
    Chowdhury, Shahana Yasmin
    Shatabda, Swakkhar
    Dehzangi, Abdollah
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [13] COMB DG, 1966, J BIOL CHEM, V241, P1857
  • [14] Dehzangi Abdollah, 2013, Pattern Recognition in Bioinformatics. 8th IAPR International Conference, PRIB 2013. Proceedings: LNCS 7986, P196, DOI 10.1007/978-3-642-39159-0_18
  • [15] Improving succinylation prediction accuracy by incorporating the secondary structure via helix, strand and coil, and evolutionary information from profile bigrams
    Dehzangi, Abdollah
    Lopez, Yosvany
    Lal, Sunil Pranit
    Taherzadeh, Ghazaleh
    Sattar, Abdul
    Tsunoda, Tatsuhiko
    Sharma, Alok
    [J]. PLOS ONE, 2018, 13 (02):
  • [16] PSSM-Suc: Accurately predicting succinylation using position specific scoring matrix into bigram for feature extraction
    Dehzangi, Abdollah
    Lopez, Yosvany
    Lal, Sunil Pranit
    Taherzadeh, Ghazaleh
    Michaelson, Jacob
    Sattar, Abdul
    Tsunoda, Tatsuhiko
    Sharma, Alok
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2017, 425 : 97 - 102
  • [17] Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou's general PseAAC
    Dehzangi, Abdollah
    Heffernan, Rhys
    Sharma, Alok
    Lyons, James
    Paliwal, Kuldip
    Sattar, Abdul
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2015, 364 : 284 - 294
  • [18] A Combination of Feature Extraction Methods with an Ensemble of Different Classifiers for Protein Structural Class Prediction Problem
    Dehzangi, Abdollah
    Paliwal, Kuldip
    Sharma, Alok
    Dehzangi, Omid
    Sattar, Abdul
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (03) : 564 - 575
  • [19] A Comprehensive Review of In silico Analysis for Protein S-sulfenylation Sites
    Hasan, Md Mehedi
    Khatun, Mst Shamima
    Kurata, Hiroyuki
    [J]. PROTEIN AND PEPTIDE LETTERS, 2018, 25 (09) : 815 - 821
  • [20] Metabolic Regulation by Lysine Malonylation, Succinylation, and Glutarylation
    Hirschey, Matthew D.
    Zhao, Yingming
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2015, 14 (09) : 2308 - 2315