Predicting protein-peptide binding residues via interpretable deep learning

被引:36
|
作者
Wang, Ruheng [1 ,2 ]
Jin, Junru [1 ,2 ]
Zou, Quan [3 ]
Nakai, Kenta [4 ]
Wei, Leyi [1 ,2 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Shandong Univ, Joint SDU NTU Ctr Artificial Intelligence Res C F, Jinan 250101, Peoples R China
[3] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 610054, Peoples R China
[4] Univ Tokyo, Inst Med Sci, Human Genome Ctr, Tokyo 1088639, Japan
基金
中国国家自然科学基金;
关键词
SEQUENCE-BASED PREDICTION; SITES; DNA;
D O I
10.1093/bioinformatics/btac352
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A Summary: Identifying the protein-peptide binding residues is fundamentally important to understand the mechanisms of protein functions and explore drug discovery. Although several computational methods have been developed, most of them highly rely on third-party tools or complex data preprocessing for feature design, easily resulting in low computational efficacy and suffering from low predictive performance. To address the limitations, we propose PepBCL, a novel BERT (Bidirectional Encoder Representation from Transformers) -based contrastive learning framework to predict the protein-peptide binding residues based on protein sequences only. PepBCL is an end-to-end predictive model that is independent of feature engineering. Specifically, we introduce a well pre-trained protein language model that can automatically extract and learn high-latent representations of protein sequences relevant for protein structures and functions. Further, we design a novel contrastive learning module to optimize the feature representations of binding residues underlying the imbalanced dataset. We demonstrate that our proposed method significantly outperforms the state-of-the-art methods under benchmarking comparison, and achieves more robust performance. Moreover, we found that we further improve the performance via the integration of traditional features and our learnt features. Interestingly, the interpretable analysis of our model highlights the flexibility and adaptability of deep learning-based protein language model to capture both conserved and non-conserved sequential characteristics of peptide-binding residues. Finally, to facilitate the use of our method, we establish an online predictive platform as the implementation of the proposed PepBCL, which is now available at http://server.wei-group.net/PepBCL/.
引用
收藏
页码:3351 / 3360
页数:10
相关论文
共 50 条
  • [41] SPPPred: Sequence-Based Protein-Peptide Binding Residue Prediction Using Genetic Programming and Ensemble Learning
    Shafiee, Shima
    Fathi, Abdolhossein
    Taherzadeh, Ghazaleh
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 2029 - 2040
  • [42] MDockPeP2: Predicting protein-peptide complex structures by accounting for peptide flexibility in long peptides
    Xu, Xianjin
    Zou, Xiaoqin
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [43] Protein embeddings and deep learning predict binding residues for various ligand classes
    Maria Littmann
    Michael Heinzinger
    Christian Dallago
    Konstantin Weissenow
    Burkhard Rost
    Scientific Reports, 11
  • [44] Protein embeddings and deep learning predict binding residues for various ligand classes
    Littmann, Maria
    Heinzinger, Michael
    Dallago, Christian
    Weissenow, Konstantin
    Rost, Burkhard
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [45] DP-site: A dual deep learning-based method for protein-peptide interaction site prediction
    Shafiee, Shima
    Fathi, Abdolhossein
    Taherzadeh, Ghazaleh
    METHODS, 2024, 229 : 17 - 29
  • [46] Industry return prediction via interpretable deep learning
    Zografopoulos, Lazaros
    Iannino, Maria Chiara
    Psaradellis, Ioannis
    Sermpinis, Georgios
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2025, 321 (01) : 257 - 268
  • [47] Recent advances in structure-based prediction of protein-peptide binding affinities
    Beuming, Thijs
    Li, Hubert
    Feyfant, Eric
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [48] Conformational Contribution to Thermodynamics of Binding in Protein-Peptide Complexes through Microscopic Simulation
    Das, Amit
    Chakrabarti, J.
    Ghosh, Mahua
    BIOPHYSICAL JOURNAL, 2013, 104 (06) : 1274 - 1284
  • [49] COMP 457-Predicting bound protein-peptide conformations: Application to MHC-peptide complexes
    Antes, Iris
    Lengauer, Thomas
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 234
  • [50] Impact of Halogen Bonds on Protein-Peptide Binding and Protein Structural Stability Revealed by Computational Approaches
    Li, Jintian
    Zhou, Liping
    Han, Zijian
    Wu, Leyun
    Zhang, Jianfang
    Zhu, Weiliang
    Xu, Zhijian
    JOURNAL OF MEDICINAL CHEMISTRY, 2024, 67 (06) : 4782 - 4792