Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context

被引:0
|
作者
Mier, Pablo [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Inst Organism & Mol Evolut, Fac Biol, Hans Dieter Husch Weg 15, D-55128 Mainz, Germany
关键词
Homorepeat; Polyglutamine; Polyalanine; Protein-protein interaction; Machine learning; STRUCTURAL BASIS; AGGREGATION; RECOGNITION; HOMOREPEATS; POLYALANINE; EVOLUTION; EXPANSION; REGIONS; FIR;
D O I
10.1016/j.heliyon.2024.e37861
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Homorepeats, specifically polyglutamine (polyQ) and polyalanine (polyA), are often implicated in protein-protein interactions (PPIs). So far, a method to predict the participation of homorepeats in protein interactions is lacking. We propose a machine learning approach to identify PPI-involved polyQ and polyA regions within the human proteome based on known interacting regions. Using the dataset of human homorepeats, we identified 157 polyQ and 745 polyA regions potentially involved in PPIs. Machine learning models, trained on amino acid context and homorepeat length, demonstrated high precision (0.90-0.98) but variable recall (0.42-0.85). Random forest outperformed other models (AUC polyQ = 0.686, AUC polyA = 0.732) using the positions surrounding the homorepeat -10 to +10. Integrating paralog information marginally improved predictions but was excluded for model simplicity. Further optimization revealed that for polyQ, using amino acid surrounding positions from -6 to +6 increased AUC to 0.715. For polyA, no improvement was found. Incorporating coiled coil overlap information enhanced polyA predictions (AUC = 0.745) but not polyQ. Finally, we applied these models to predict PPI involvement across all polyQ and polyA regions, identifying potential interactions. Case studies illustrated the method's predictive capacity, highlighting known interacting regions with high scores and elucidating potential false negatives.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Protein-Protein Interactions in Translesion Synthesis
    Dash, Radha Charan
    Hadden, Kyle
    MOLECULES, 2021, 26 (18):
  • [22] Predicting Protein-Protein Interactions based on Biological Information using Extreme Gradient Boosting
    Beltran, Jerome Cary
    Valdez, Paolo
    Naval, Prospero, Jr.
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY - CIBCB 2019, 2019, : 346 - 351
  • [23] Predator: Predicting the Impact of Cancer Somatic Mutations on Protein-Protein Interactions
    Berber, Ibrahim
    Erten, Cesim
    Kazan, Hilal
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3163 - 3172
  • [24] Analyzing Effect of Multi-Modality in Predicting Protein-Protein Interactions
    Jha, Kanchan
    Saha, Sriparna
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 162 - 173
  • [25] Predicting Protein-Protein Interactions with Weighted PSSM Histogram and Random Forests
    Wei, Zhi-Sen
    Yang, Jing-Yu
    Yu, Dong-Jun
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 326 - 335
  • [26] Generation of an Orthogonal Protein-Protein Interface with a Noncanonical Amino Acid
    Koh, Minseob
    Nasertorabi, Fariborz
    Han, Gye Won
    Stevens, Raymond C.
    Schultz, Peter G.
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2017, 139 (16) : 5728 - 5731
  • [27] A Bifunctional Amino Acid Enables Both Covalent Chemical Capture and Isolation of in Vivo Protein-Protein Interactions
    Joiner, Cassandra M.
    Breen, Meghan E.
    Clayton, James
    Mapp, Anna K.
    CHEMBIOCHEM, 2017, 18 (02) : 181 - 184
  • [28] Combined chemical shift changes and amino acid specific chemical shift mapping of protein-protein interactions
    Schumann, Frank H.
    Riepl, Hubert
    Maurer, Till
    Gronwald, Wolfram
    Neidig, Klaus-Peter
    Kalbitzer, Hans Robert
    JOURNAL OF BIOMOLECULAR NMR, 2007, 39 (04) : 275 - 289
  • [29] A Bayesian Framework for Combining Protein and Network Topology Information for Predicting Protein-Protein Interactions
    Birlutiu, Adriana
    d'Alche-Buc, Florence
    Heskes, Tom
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (03) : 538 - 550
  • [30] Sequence-based machine learning method for predicting the effects of phosphorylation on protein-protein interactions
    Hong, Xiaokun
    Lv, Jiyang
    Li, Zhengxin
    Xiong, Yi
    Zhang, Jian
    Chen, Hai-Feng
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 243