Gly-LysPred: Identification of Lysine Glycation Sites in Protein Using Position Relative Features and Statistical Moments via Chou's 5 Step Rule

被引:3
作者
Khanum, Shaheena [1 ]
Ashraf, Muhammad Adeel [2 ]
Karim, Asim [1 ]
Shoaib, Bilal [3 ]
Khan, Muhammad Adnan [4 ]
Naqvi, Rizwan Ali [5 ]
Siddique, Kamran [6 ]
Alswaitti, Mohammed [6 ]
机构
[1] Lahore Univ Management Sci, Dept Comp Sci, Lahore 54792, Pakistan
[2] Univ Management & Technol, Dept Comp Sci, Lahore 54770, Pakistan
[3] Minhaj Univ Lahore, Sch Comp Sci, Lahore 54770, Pakistan
[4] Lahore Garrison Univ, Dept Comp Sci, Lahore 54000, Pakistan
[5] Sejong Univ, Dept Unmanned Vehicle Engn, Seoul, South Korea
[6] Xiamen Univ Malaysia, Sch Elect & Comp Engn, Dept Informat & Commun Technol, Sepang 43900, Malaysia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 66卷 / 02期
关键词
Gly-LysPred; PseAAC; post-translational modification; lysine glycation; Chou's 5 step rule; position relative features; POSTTRANSLATIONAL MODIFICATIONS; END-PRODUCTS; PREDICTION; PHOSPHORYLATION; DOMAINS; BINDING; AGENTS;
D O I
10.32604/cmc.2020.013646
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Glycation is a non-enzymatic post-translational modification which assigns sugar molecule and residues to a peptide. It is a clinically important attribute to numerous age-related, metabolic, and chronic diseases such as diabetes, Alzheimer's, renal failure, etc. Identification of a non-enzymatic reaction are quite challenging in research. Manual identification in labs is a very costly and time-consuming process. In this research, we developed an accurate, valid, and a robust model named as Gly-LysPred to differentiate the glycated sites from non-glycated sites. Comprehensive techniques using position relative features are used for feature extraction. An algorithm named as a random forest with some preprocessing techniques and feature engineering techniques was developed to train a computational model. Various types of testing techniques such as self-consistency testing, jackknife testing, and cross-validation testing are used to evaluate the model. The overall model's accuracy was accomplished through self-consistency, jackknife, and cross-validation testing 100%, 99.92%, and 99.88% with MCC 1.00, 0.99, and 0.997 respectively. In this regard, a user-friendly webserver is also urbanized to accumulate the whole procedure. These features vectorization methods suggest that they can play a critical role in other web servers which are developed to classify lysine glycation.
引用
收藏
页码:2165 / 2181
页数:17
相关论文
共 93 条
  • [1] Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition
    Ahmad, Khurshid
    Waris, Muhammad
    Hayat, Maqsood
    [J]. JOURNAL OF MEMBRANE BIOLOGY, 2016, 249 (03) : 293 - 304
  • [2] Molecular strategies to prevent, inhibit, and degrade advanced glycoxidation and advanced lipoxidation end products
    Aldini, Giancarlo
    Vistoli, Giulio
    Stefek, Milan
    Chondrogianni, N.
    Grune, Tilman
    Sereikaite, Jolanta
    Sadowska-Bartosz, Izabela
    Bartosz, Grzegorz
    [J]. FREE RADICAL RESEARCH, 2013, 47 : 93 - 137
  • [3] Classification of membrane protein types using Voting Feature Interval in combination with Chou's Pseudo Amino Acid Composition
    Ali, Farman
    Hayat, Maqsood
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2015, 384 : 78 - 83
  • [4] Identification of Lysine Carboxylation Sites in Proteins by Integrating Statistical Moments and Position Relative Features via General PseAAC
    Amanat, Saba
    Ashraf, Adeel
    Hussain, Waqar
    Rasool, Nouman
    Khan, Yaser D.
    [J]. CURRENT BIOINFORMATICS, 2020, 15 (05) : 396 - 407
  • [5] Anwar, 2012, World Applied Sciences Journal, V16, P678
  • [6] The crucial role of protein phosphorylation in cell signaling and its use as targeted therapy
    Ardito, Fatima
    Giuliani, Michele
    Perrone, Donatella
    Troiano, Giuseppe
    Lo Muzio, Lorenzo
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE, 2017, 40 (02) : 271 - 280
  • [7] Biochemical and biophysical characterization of a plant calmodulin: Role of the N- and C-lobes in calcium binding, conformational change, and target interaction
    Astegno, Alessandra
    La Verde, Valentina
    Marino, Valerio
    Dell'Orco, Daniele
    Dominici, Paola
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2016, 1864 (03): : 297 - 307
  • [8] GENERALIZED RANDOM FORESTS
    Athey, Susan
    Tibshirani, Julie
    Wager, Stefan
    [J]. ANNALS OF STATISTICS, 2019, 47 (02) : 1148 - 1178
  • [9] Protein post-translational modifications: &ITIn silico&IT prediction tools and molecular modeling
    Audagnotto, Martina
    Dal Peraro, Matteo
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2017, 15 : 307 - 319
  • [10] Evidence-Based Recommendations for Optimal Dietary Protein Intake in Older People: A Position Paper From the PROT-AGE Study Group
    Bauer, Juergen
    Biolo, Gianni
    Cederholm, Tommy
    Cesari, Matteo
    Cruz-Jentoft, Alfonso J.
    Morley, John E.
    Phillips, Stuart
    Sieber, Cornel
    Stehle, Peter
    Teta, Daniel
    Visvanathan, Renuka
    Volpi, Elena
    Boirie, Yves
    [J]. JOURNAL OF THE AMERICAN MEDICAL DIRECTORS ASSOCIATION, 2013, 14 (08) : 542 - 559