Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context

被引:0
作者
Mier, Pablo [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Inst Organism & Mol Evolut, Fac Biol, Hans Dieter Husch Weg 15, D-55128 Mainz, Germany
关键词
Homorepeat; Polyglutamine; Polyalanine; Protein-protein interaction; Machine learning; STRUCTURAL BASIS; AGGREGATION; RECOGNITION; HOMOREPEATS; POLYALANINE; EVOLUTION; EXPANSION; REGIONS; FIR;
D O I
10.1016/j.heliyon.2024.e37861
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Homorepeats, specifically polyglutamine (polyQ) and polyalanine (polyA), are often implicated in protein-protein interactions (PPIs). So far, a method to predict the participation of homorepeats in protein interactions is lacking. We propose a machine learning approach to identify PPI-involved polyQ and polyA regions within the human proteome based on known interacting regions. Using the dataset of human homorepeats, we identified 157 polyQ and 745 polyA regions potentially involved in PPIs. Machine learning models, trained on amino acid context and homorepeat length, demonstrated high precision (0.90-0.98) but variable recall (0.42-0.85). Random forest outperformed other models (AUC polyQ = 0.686, AUC polyA = 0.732) using the positions surrounding the homorepeat -10 to +10. Integrating paralog information marginally improved predictions but was excluded for model simplicity. Further optimization revealed that for polyQ, using amino acid surrounding positions from -6 to +6 increased AUC to 0.715. For polyA, no improvement was found. Incorporating coiled coil overlap information enhanced polyA predictions (AUC = 0.745) but not polyQ. Finally, we applied these models to predict PPI involvement across all polyQ and polyA regions, identifying potential interactions. Case studies illustrated the method's predictive capacity, highlighting known interacting regions with high scores and elucidating potential false negatives.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Progress and challenges in predicting protein-protein interaction sites
    Ezkurdia, Lakes
    Bartoli, Lisa
    Fariselli, Piero
    Casadio, Rita
    Valencia, Alfonso
    Tress, Michael L.
    BRIEFINGS IN BIOINFORMATICS, 2009, 10 (03) : 233 - 246
  • [42] Geometric and amino acid type determinants for protein-protein interaction interfaces
    Yang, Yongxiao
    Wang, Wei
    Lou, Yuan
    Yin, Jianxin
    Gong, Xinqi
    QUANTITATIVE BIOLOGY, 2018, 6 (02) : 163 - 174
  • [43] Protein-protein binding affinity prediction from amino acid sequence
    Yugandhar, K.
    Gromiha, M. Michael
    BIOINFORMATICS, 2014, 30 (24) : 3583 - 3589
  • [44] Supervised learning approaches for predicting Ebola-Human Protein-Protein interactions
    Dey, Lopamudra
    Chakraborty, Sanjay
    GENE, 2025, 942
  • [45] Using ensemble methods to deal with imbalanced data in predicting protein-protein interactions
    Zhang, Yongqing
    Zhang, Danling
    Mi, Gang
    Ma, Daichuan
    Li, Gongbing
    Guo, Yanzhi
    Li, Menglong
    Zhu, Min
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2012, 36 : 36 - 41
  • [46] Modeling of Protein-Protein Interactions in Cytokinin Signal Transduction
    Arkhipov, Dmitry V.
    Lomin, Sergey N.
    Myakushina, Yulia A.
    Savelieva, Ekaterina M.
    Osolodkin, Dmitry I.
    Romanov, Georgy A.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (09)
  • [47] Galvanization of Protein-Protein Interactions in a Dynamic Zinc Interactome
    Kocyla, Anna
    Tran, Jozef Ba
    Krezel, Artur
    TRENDS IN BIOCHEMICAL SCIENCES, 2021, 46 (01) : 64 - 79
  • [48] Transient protein-protein interactions visualized by solution NMR
    Liu, Zhu
    Gong, Zhou
    Dong, Xu
    Tang, Chun
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2016, 1864 (01): : 115 - 122
  • [49] Coevolutive, evolutive and stochastic information in protein-protein interactions
    Andrade, Miguel
    Pontes, Camila
    Treptow, Werner
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2019, 17 : 1429 - 1435
  • [50] Structural prediction of protein-protein interactions in Saccharomyces cerevisiae
    Paradesi, Martin S. R.
    Caragea, Doina
    Hsu, William H.
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 1270 - 1274