Leveraging Protein Dynamics to Identify Functional Phosphorylation Sites using Deep Learning Models

被引:11
|
作者
Zhu, Fei [1 ,4 ]
Yang, Sijie [4 ]
Meng, Fanwang [5 ]
Zheng, Yuxiang [1 ]
Ku, Xin [2 ]
Luo, Cheng [3 ]
Hu, Guang [1 ]
Liang, Zhongjie [1 ,2 ,3 ]
机构
[1] Soochow Univ, Ctr Syst Biol, Sch Biol & Basic Med Sci, Dept Bioinformat, Suzhou 215123, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai Ctr Syst Biomed, Key Lab Syst Biomed, Minist Educ, Shanghai 200240, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Mat Med, State Key Lab Drug Res, Shanghai 201203, Peoples R China
[4] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[5] McMaster Univ, Dept Chem & Chem Biol, Hamilton, ON L8S 4L8, Canada
基金
中国国家自然科学基金;
关键词
POSTTRANSLATIONAL MODIFICATION SITES; REGULATORY ELEMENTS; STRUCTURAL-ANALYSIS; SEQUENCE EVOLUTION; UBIQUITINATION; CROSSTALK; MUTATIONS; DBPTM; TOOL;
D O I
10.1021/acs.jcim.2c00484
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Accurate prediction of post-translational modifications (PTMs) is of great significance in understanding cellular processes, by modulating protein structure and dynamics. Nowadays, with the rapid growth of protein data at different "omics " levels, machine learning models largely enriched the prediction of PTMs. However, most machine learning models only rely on protein sequence and little structural information. The lack of the systematic dynamics analysis underlying PTMs largely limits the PTM functional predictions. In this research, we present two dynamics-centric deep learning models, namely, cDL-PAU and cDL-FuncPhos, by incorporating sequence, structure, and dynamics-based features to elucidate the molecular basis and underlying functional landscape of PTMs. cDL-PAU achieved satisfactory area under the curve (AUC) scores of 0.804- 0.888 for predicting phosphorylation, acetylation, and ubiquitination (PAU) sites, while cDL-FuncPhos achieved an AUC value of 0.771 for predicting functional phosphorylation (FuncPhos) sites, displaying reliable improvements. Through a feature selection, the dynamics-based coupling and commute ability show large contributions in discovering PAU sites and FuncPhos sites, suggesting the allosteric propensity for important PTMs. The application of cDL-FuncPhos in three oncoproteins not only corroborates its strong performance in FuncPhos prioritization but also gains insight into the physical basis for the functions. The source code and data set of cDL-PAU and cDL-FuncPhos are available at https://github.com/ComputeSuda/PTM_ML.
引用
收藏
页码:3331 / 3345
页数:15
相关论文
共 50 条
  • [41] A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
    Thapa, Niraj
    Chaudhari, Meenal
    Iannetta, Anthony A.
    White, Clarence
    Roy, Kaushik
    Newman, Robert H.
    Hicks, Leslie M.
    Kc, Dukka B.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [42] DF-Phos: Prediction of Protein Phosphorylation Sites by Deep Forest
    Zahiri, Zeynab
    Mehrshad, Nasser
    Mehrshad, Maliheh
    JOURNAL OF BIOCHEMISTRY, 2024, 175 (04): : 447 - 456
  • [43] PhosAF: An integrated deep learning architecture for predicting protein phosphorylation sites with AlphaFold2 predicted structures
    Yu, Ziyuan
    Yu, Jialin
    Wang, Hongmei
    Zhang, Shuai
    Zhao, Long
    Shi, Shaoping
    ANALYTICAL BIOCHEMISTRY, 2024, 690
  • [44] Leveraging YOLO deep learning models to enhance plant disease identification
    Alhwaiti, Yousef
    Khan, Muntazir
    Asim, Muhammad
    Siddiqi, Muhammad Hameed
    Ishaq, Muhammad
    Alruwaili, Madallah
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [45] Leveraging multimodal microscopy to optimize deep learning models for cell segmentation
    Cameron, William D.
    Bennett, Alex M.
    Bui, Cindy V.
    Chang, Huntley H.
    Rocheleau, Jonathan V.
    APL BIOENGINEERING, 2021, 5 (01)
  • [46] An accurate, sensitive, and scalable method to identify functional sites in protein structures
    Yao, H
    Kristensen, DM
    Mihalek, I
    Sowa, ME
    Shaw, C
    Kimmel, M
    Kavraki, L
    Lichtarge, O
    JOURNAL OF MOLECULAR BIOLOGY, 2003, 326 (01) : 255 - 261
  • [47] PTG-PLM: Predicting Post-Translational Glycosylation and Glycation Sites Using Protein Language Models and Deep Learning
    Alkuhlani, Alhasan
    Gad, Walaa
    Roushdy, Mohamed
    Voskoglou, Michael Gr
    Salem, Abdel-badeeh M.
    AXIOMS, 2022, 11 (09)
  • [48] Language models can identify enzymatic binding sites in protein sequences
    Teukam, Yves Gaetan Nana
    Dassi, Loic Kwate
    Manica, Matteo
    Probst, Daniel
    Schwaller, Philippe
    Laino, Teodoro
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 1929 - 1937
  • [49] Effect of sequence padding on the performance of deep learning models in archaeal protein functional prediction
    Angela Lopez-del Rio
    Maria Martin
    Alexandre Perera-Lluna
    Rabie Saidi
    Scientific Reports, 10
  • [50] Effect of sequence padding on the performance of deep learning models in archaeal protein functional prediction
    Lopez-del Rio, Angela
    Martin, Maria
    Perera-Lluna, Alexandre
    Saidi, Rabie
    SCIENTIFIC REPORTS, 2020, 10 (01)