Leveraging Protein Dynamics to Identify Functional Phosphorylation Sites using Deep Learning Models

被引:11
|
作者
Zhu, Fei [1 ,4 ]
Yang, Sijie [4 ]
Meng, Fanwang [5 ]
Zheng, Yuxiang [1 ]
Ku, Xin [2 ]
Luo, Cheng [3 ]
Hu, Guang [1 ]
Liang, Zhongjie [1 ,2 ,3 ]
机构
[1] Soochow Univ, Ctr Syst Biol, Sch Biol & Basic Med Sci, Dept Bioinformat, Suzhou 215123, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai Ctr Syst Biomed, Key Lab Syst Biomed, Minist Educ, Shanghai 200240, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Mat Med, State Key Lab Drug Res, Shanghai 201203, Peoples R China
[4] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[5] McMaster Univ, Dept Chem & Chem Biol, Hamilton, ON L8S 4L8, Canada
基金
中国国家自然科学基金;
关键词
POSTTRANSLATIONAL MODIFICATION SITES; REGULATORY ELEMENTS; STRUCTURAL-ANALYSIS; SEQUENCE EVOLUTION; UBIQUITINATION; CROSSTALK; MUTATIONS; DBPTM; TOOL;
D O I
10.1021/acs.jcim.2c00484
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Accurate prediction of post-translational modifications (PTMs) is of great significance in understanding cellular processes, by modulating protein structure and dynamics. Nowadays, with the rapid growth of protein data at different "omics " levels, machine learning models largely enriched the prediction of PTMs. However, most machine learning models only rely on protein sequence and little structural information. The lack of the systematic dynamics analysis underlying PTMs largely limits the PTM functional predictions. In this research, we present two dynamics-centric deep learning models, namely, cDL-PAU and cDL-FuncPhos, by incorporating sequence, structure, and dynamics-based features to elucidate the molecular basis and underlying functional landscape of PTMs. cDL-PAU achieved satisfactory area under the curve (AUC) scores of 0.804- 0.888 for predicting phosphorylation, acetylation, and ubiquitination (PAU) sites, while cDL-FuncPhos achieved an AUC value of 0.771 for predicting functional phosphorylation (FuncPhos) sites, displaying reliable improvements. Through a feature selection, the dynamics-based coupling and commute ability show large contributions in discovering PAU sites and FuncPhos sites, suggesting the allosteric propensity for important PTMs. The application of cDL-FuncPhos in three oncoproteins not only corroborates its strong performance in FuncPhos prioritization but also gains insight into the physical basis for the functions. The source code and data set of cDL-PAU and cDL-FuncPhos are available at https://github.com/ComputeSuda/PTM_ML.
引用
收藏
页码:3331 / 3345
页数:15
相关论文
共 50 条
  • [31] leveraging protein dynamics to identify mutational hotspot communities in cancer driver genes
    Kumar, Sushant
    Clarke, Declan
    Gerstein, Mark
    CANCER RESEARCH, 2019, 79 (13)
  • [32] PREDICTIVE DEEP LEARNING MODELS TO IDENTIFY TRAUMATIC BRAIN INJURIES USING MRI DATA
    Nidamanuri, Sreecharita
    Masood, Jafar ali ibrahim syed
    Asirvatham, David
    Abas, Fazly salleh bin
    Veerappan, Kousik nalliyanna goundar
    Budiman, Faisal
    Alam, Nur
    Sugumaran, Vijayan
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2025, 20 (02): : 362 - 379
  • [33] Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
    Chua, Kurtland
    Calandra, Roberto
    McAllister, Rowan
    Levine, Sergey
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [34] Predicting phosphorylation sites using machine learning by integrating the sequence, structure, and functional information of proteins
    Salma Jamal
    Waseem Ali
    Priya Nagpal
    Abhinav Grover
    Sonam Grover
    Journal of Translational Medicine, 19
  • [35] Predicting phosphorylation sites using machine learning by integrating the sequence, structure, and functional information of proteins
    Jamal, Salma
    Ali, Waseem
    Nagpal, Priya
    Grover, Abhinav
    Grover, Sonam
    JOURNAL OF TRANSLATIONAL MEDICINE, 2021, 19 (01)
  • [36] A simple way to identify functional sites in protein .4. Active sites of trypsin
    Kiho, Y
    Bhandari, G
    Okada, Y
    PROCEEDINGS OF THE JAPAN ACADEMY SERIES B-PHYSICAL AND BIOLOGICAL SCIENCES, 1996, 72 (01): : 11 - 15
  • [37] Incorporating Deep Learning With Word Embedding to Identify Plant Ubiquitylation Sites
    Wang, Hongfei
    Wang, Zhuo
    Li, Zhongyan
    Lee, Tzong-Yi
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2020, 8
  • [38] Deep Learning to Identify Transcription Start Sites from CAGE Data
    Zheng, Hansi
    Li, Xiaoman
    Hu, Haiyan
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 168 - 172
  • [39] Application of Machine Learning Techniques to Predict Protein Phosphorylation Sites
    Zhang, Shengli
    Li, Xian
    Fan, Chengcheng
    Wu, Zhehui
    Liu, Qian
    LETTERS IN ORGANIC CHEMISTRY, 2019, 16 (04) : 247 - 257
  • [40] A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
    Niraj Thapa
    Meenal Chaudhari
    Anthony A. Iannetta
    Clarence White
    Kaushik Roy
    Robert H. Newman
    Leslie M. Hicks
    Dukka B. KC
    Scientific Reports, 11