Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context

被引:0
作者
Mier, Pablo [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Inst Organism & Mol Evolut, Fac Biol, Hans Dieter Husch Weg 15, D-55128 Mainz, Germany
关键词
Homorepeat; Polyglutamine; Polyalanine; Protein-protein interaction; Machine learning; STRUCTURAL BASIS; AGGREGATION; RECOGNITION; HOMOREPEATS; POLYALANINE; EVOLUTION; EXPANSION; REGIONS; FIR;
D O I
10.1016/j.heliyon.2024.e37861
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Homorepeats, specifically polyglutamine (polyQ) and polyalanine (polyA), are often implicated in protein-protein interactions (PPIs). So far, a method to predict the participation of homorepeats in protein interactions is lacking. We propose a machine learning approach to identify PPI-involved polyQ and polyA regions within the human proteome based on known interacting regions. Using the dataset of human homorepeats, we identified 157 polyQ and 745 polyA regions potentially involved in PPIs. Machine learning models, trained on amino acid context and homorepeat length, demonstrated high precision (0.90-0.98) but variable recall (0.42-0.85). Random forest outperformed other models (AUC polyQ = 0.686, AUC polyA = 0.732) using the positions surrounding the homorepeat -10 to +10. Integrating paralog information marginally improved predictions but was excluded for model simplicity. Further optimization revealed that for polyQ, using amino acid surrounding positions from -6 to +6 increased AUC to 0.715. For polyA, no improvement was found. Incorporating coiled coil overlap information enhanced polyA predictions (AUC = 0.745) but not polyQ. Finally, we applied these models to predict PPI involvement across all polyQ and polyA regions, identifying potential interactions. Case studies illustrated the method's predictive capacity, highlighting known interacting regions with high scores and elucidating potential false negatives.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] NMR studies of weak protein-protein interactions
    Lian, Lu-Yun
    PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY, 2013, 71 : 59 - 72
  • [32] Osmolytes and Protein-Protein Interactions
    Rydeen, Amy E.
    Brustad, Eric M.
    Pielak, Gary J.
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2018, 140 (24) : 7441 - 7444
  • [33] Protein-Protein Interactions in Plants
    Fukao, Yoichiro
    PLANT AND CELL PHYSIOLOGY, 2012, 53 (04) : 617 - 625
  • [34] Probabilistic prediction of protein-protein interactions from the protein sequences
    Chinnasamy, Arunkumar
    Mittal, Ankush
    Sung, Wing-Kin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (10) : 1143 - 1154
  • [35] A deep learning algorithm for predicting protein-protein interactions with nonnegative latent factorization
    Wang, Liwei
    Hu, Lun
    2021 INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SOCIAL INTELLIGENCE (ICCSI), 2021,
  • [36] Clustered complementary amino acid pairing (CCAAP) for protein-protein interaction
    Baek, Christina Kyung Eun
    Baek, Chang-Ho
    BIOTECHNOLOGY LETTERS, 2019, 41 (01) : 79 - 90
  • [37] Inhibitors of protein-protein interactions
    Ockey, DA
    Gadek, TR
    EXPERT OPINION ON THERAPEUTIC PATENTS, 2002, 12 (03) : 393 - 400
  • [38] Prediction of Protein-Protein Interactions from Amino Acid Sequences Based on Continuous and Discrete Wavelet Transform Features
    Wang, Tao
    Li, Liping
    Huang, Yu-An
    Zhang, Hui
    Ma, Yahong
    Zhou, Xing
    MOLECULES, 2018, 23 (04):
  • [39] SAAMBE-3D: Predicting Effect of Mutations on Protein-Protein Interactions
    Pahari, Swagata
    Li, Gen
    Murthy, Adithya Krishna
    Liang, Siqi
    Fragoza, Robert
    Yu, Haiyuan
    Alexov, Emil
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (07)
  • [40] Identifying Dominant Amino Acid Pairs of Known Protein-Protein Interactions via K-Means Clustering
    Ngamsuriyaroj, Sudsanguan
    Thepsutum, Kittirat
    2017 19TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS (HPCC) / 2017 15TH IEEE INTERNATIONAL CONFERENCE ON SMART CITY (SMARTCITY) / 2017 3RD IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (DSS), 2017, : 286 - 291