Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context

被引:0
|
作者
Mier, Pablo [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Inst Organism & Mol Evolut, Fac Biol, Hans Dieter Husch Weg 15, D-55128 Mainz, Germany
关键词
Homorepeat; Polyglutamine; Polyalanine; Protein-protein interaction; Machine learning; STRUCTURAL BASIS; AGGREGATION; RECOGNITION; HOMOREPEATS; POLYALANINE; EVOLUTION; EXPANSION; REGIONS; FIR;
D O I
10.1016/j.heliyon.2024.e37861
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Homorepeats, specifically polyglutamine (polyQ) and polyalanine (polyA), are often implicated in protein-protein interactions (PPIs). So far, a method to predict the participation of homorepeats in protein interactions is lacking. We propose a machine learning approach to identify PPI-involved polyQ and polyA regions within the human proteome based on known interacting regions. Using the dataset of human homorepeats, we identified 157 polyQ and 745 polyA regions potentially involved in PPIs. Machine learning models, trained on amino acid context and homorepeat length, demonstrated high precision (0.90-0.98) but variable recall (0.42-0.85). Random forest outperformed other models (AUC polyQ = 0.686, AUC polyA = 0.732) using the positions surrounding the homorepeat -10 to +10. Integrating paralog information marginally improved predictions but was excluded for model simplicity. Further optimization revealed that for polyQ, using amino acid surrounding positions from -6 to +6 increased AUC to 0.715. For polyA, no improvement was found. Incorporating coiled coil overlap information enhanced polyA predictions (AUC = 0.745) but not polyQ. Finally, we applied these models to predict PPI involvement across all polyQ and polyA regions, identifying potential interactions. Case studies illustrated the method's predictive capacity, highlighting known interacting regions with high scores and elucidating potential false negatives.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Sequence Representations and Their Utility for Predicting Protein-Protein Interactions
    Kimothi, Dhananjay
    Biyani, Pravesh
    Hogan, James M.
    Davis, Melissa J.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 646 - 657
  • [42] Predicting protein-protein interactions in the post synaptic density
    Bar-shira, Ossnat
    Chechik, Gal
    MOLECULAR AND CELLULAR NEUROSCIENCE, 2013, 56 : 128 - 139
  • [43] Predicting protein-protein interactions from primary structure
    Bock, JR
    Gough, DA
    BIOINFORMATICS, 2001, 17 (05) : 455 - 460
  • [44] Probing Protein-Protein Interactions with a Genetically Encoded Photo-crosslinking Amino Acid
    Ai, Hui-wang
    Shen, Weijun
    Sagi, Amit
    Chen, Peng R.
    Schultz, Peter G.
    CHEMBIOCHEM, 2011, 12 (12) : 1854 - 1857
  • [45] Optimization of protein-protein docking for predicting Fc-protein interactions
    Agostino, Mark
    Mancera, Ricardo L.
    Ramsland, Paul A.
    Fernandez-Recio, Juan
    JOURNAL OF MOLECULAR RECOGNITION, 2016, 29 (11) : 555 - 568
  • [46] Predicting protein-protein interactions based on protein-domain relationships
    Wang, B
    Huang, DS
    Chen, P
    Zhu, YP
    Li, YX
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 316 - 319
  • [47] Predicting molecular interactions by protein-protein and protein-RNA docking
    Zou, Xiaoqin
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [48] InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
    Sun, Jingchun
    Sun, Yan
    Ding, Guohui
    Liu, Qi
    Wang, Chuan
    He, Youyu
    Shi, Tieliu
    Li, Yixue
    Zhao, Zhongming
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [49] InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
    Jingchun Sun
    Yan Sun
    Guohui Ding
    Qi Liu
    Chuan Wang
    Youyu He
    Tieliu Shi
    Yixue Li
    Zhongming Zhao
    BMC Bioinformatics, 8
  • [50] ProtInteract: A deep learning framework for predicting protein-protein interactions
    Soleymani, Farzan
    Paquet, Eric
    Viktor, Herna Lydia
    Michalowski, Wojtek
    Spinello, Davide
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 1324 - 1348