ProteinPrompt: a webserver for predicting protein-protein interactions

被引:6
|
作者
Canzler, Sebastian [1 ,2 ]
Fischer, Markus [3 ]
Ulbricht, David [3 ]
Ristic, Nikola [3 ]
Hildebrand, Peter W. [3 ,4 ,5 ,6 ,7 ]
Staritzbichler, Rene [1 ,3 ]
机构
[1] Immuthera GmbH, D-04275 Leipzig, Germany
[2] UFZ Helmholtz Ctr Environm Res, Dept Computat Biol, D-04318 Leipzig, Germany
[3] Univ Leipzig, Inst Med Phys & Biophys, D-04107 Leipzig, Germany
[4] Charite Univ Med Berlin, D-10117 Berlin, Germany
[5] Free Univ Berlin, D-10117 Berlin, Germany
[6] Humboldt Univ, Inst Med Phys & Biophys, D-10117 Berlin, Germany
[7] Charite Univ Med Berlin, Berlin Inst Hlth, D-10117 Berlin, Germany
来源
BIOINFORMATICS ADVANCES | 2022年 / 2卷 / 01期
关键词
CD-HIT; DATABASE; HYDROPHOBICITY; CLASSIFICATION; MEMBRANE; SCALE; LINKS;
D O I
10.1093/bioadv/vbac059
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation: Protein-protein interactions (PPIs) play an essential role in a great variety of cellular processes and are therefore of significant interest for the design of new therapeutic compounds as well as the identification of side effects due to unexpected binding. Here, we present ProteinPrompt, a webserver that uses machine learning algorithms to calculate specific, currently unknown PPIs. Our tool is designed to quickly and reliably predict contact propensities based on an input sequence in order to scan large sequence libraries for potential binding partners, with the goal to accelerate and assure the quality of the laborious process of drug target identification. Results: We collected and thoroughly filtered a comprehensive database of known binders from several sources, which is available as download. ProteinPrompt provides two complementary search methods of similar accuracy for comparison and consensus building. The default method is a random forest (RF) algorithm that uses the auto-correlations of seven amino acid scales. Alternatively, a graph neural network (GNN) implementation can be selected. Additionally, a consensus prediction is available. For each query sequence, potential binding partners are identified from a protein sequence database. The proteom of several organisms are available and can be searched for binders. To evaluate the predictive power of the algorithms, we prepared a test dataset that was rigorously filtered for redundancy. No sequence pairs similar to the ones used for training were included in this dataset. With this challenging dataset, the RF method achieved an accuracy rate of 0.88 and an area under the curve of 0.95. The GNN achieved an accuracy rate of 0.86 using the same dataset. Since the underlying learning approaches are unrelated, comparing the results of RF and GNNs reduces the likelihood of errors. The consensus reached an accuracy of 0.89.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Predicting global protein-protein interactions
    Rachel Brem
    Genome Biology, 1 (1)
  • [2] Hyperplanes for predicting protein-protein interactions
    Nanni, L
    NEUROCOMPUTING, 2005, 69 (1-3) : 257 - 263
  • [3] Predicting protein-protein interactions in the context of protein evolution
    Lewis, Anna C. F.
    Saeed, Ramazan
    Deane, Charlotte M.
    MOLECULAR BIOSYSTEMS, 2010, 6 (01) : 55 - 64
  • [4] SAMBE: The New Webserver for Predicting the Effect of nssNP on the Protein-Protein Binding Free Energy
    Petukh, Marharyta
    Morrison, Jacon
    Li, Minghui
    Panchenko, Anna
    Alexov, Emil
    BIOPHYSICAL JOURNAL, 2015, 108 (02) : 157A - 158A
  • [6] Fusion of classifiers for predicting protein-protein interactions
    Nanni, L
    NEUROCOMPUTING, 2005, 68 : 289 - 296
  • [7] Computational Resources for Predicting Protein-Protein Interactions
    Tanwar, Himani
    Doss, C. George Priya
    PROTEIN-PROTEIN INTERACTIONS IN HUMAN DISEASE, PT A, 2018, 110 : 251 - 275
  • [8] A database server for predicting protein-protein interactions
    Han, K
    Park, B
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS, 2004, 3036 : 271 - 278
  • [9] Computational Methods For Predicting Protein-Protein Interactions
    Pitre, Sylvain
    Alamgir, Md
    Green, James R.
    Dumontier, Michel
    Dehne, Frank
    Golshani, Ashkan
    PROTEIN - PROTEIN INTERACTION, 2008, 110 : 247 - 267
  • [10] Role of ChatGPT in predicting protein-protein interactions
    Ray, Partha Ratim
    Majumder, Poulami
    CURRENT SCIENCE, 2023, 125 (02): : 114 - 114