PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning

被引:35
作者
Bell, Eric W. [1 ]
Schwartz, Jacob H. [1 ]
Freddolino, Peter L. [1 ,2 ]
Zhang, Yang [1 ,2 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, 100 Washtenaw Ave, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Biol Chem, 1150 W Med Ctr Dr, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
direct interaction prediction; interology; dimer threading; interactome; SARS-CoV-2; WEB-SERVER; FAT10;
D O I
10.1016/j.jmb.2022.167530
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteome-wide identification of protein-protein interactions is a formidable task which has yet to be sufficiently addressed by experimental methodologies. Many computational methods have been developed to predict proteome-wide interaction networks, but few leverage both the sensitivity of structural information and the wide availability of sequence data. We present PEPPI, a pipeline which integrates structural similarity, sequence similarity, functional association data, and machine learning-based classification through a nai spacing diaeresis ve Bayesian classifier model to accurately predict protein-protein interactions at a proteomic scale. Through benchmarking against a set of 798 ground truth interactions and an equal number of non interactions, we have found that PEPPI attains 4.5% higher AUROC than the best of other state-of-theart methods. As a proteomic-scale application, PEPPI was applied to model the interactions which occur between SARS-CoV-2 and human host cells during coronavirus infection, where 403 high-confidence interactions were identified with predictions covering 73% of a gold standard dataset from PSICQUIC and demonstrating significant complementarity with the most recent high-throughput experiments. PEPPI is available both as a webserver and in a standalone version and should be a powerful and generally applicable tool for computational screening of protein-protein interactions.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 34 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
  • [3] PSICQUIC and PSISCORE: accessing and scoring molecular interactions
    Aranda, Bruno
    Blankenburg, Hagen
    Kerrien, Samuel
    Brinkman, Fiona S. L.
    Ceol, Arnaud
    Chautard, Emilie
    Dana, Jose M.
    De Las Rivas, Javier
    Dumousseau, Marine
    Galeota, Eugenia
    Gaulton, Anna
    Goll, Johannes
    Hancock, Robert E. W.
    Isserlin, Ruth
    Jimenez, Rafael C.
    Kerssemakers, Jules
    Khadake, Jyoti
    Lynn, David J.
    Michaut, Magali
    O'Kelly, Gavin
    Ono, Keiichiro
    Orchard, Sandra
    Prieto, Carlos
    Razick, Sabry
    Rigina, Olga
    Salwinski, Lukasz
    Simonovic, Milan
    Velankar, Sameer
    Winter, Andrew
    Wu, Guanming
    Bader, Gary D.
    Cesareni, Gianni
    Donaldson, Ian M.
    Eisenberg, David
    Kleywegt, Gerard J.
    Overington, John
    Ricard-Blum, Sylvie
    Tyers, Mike
    Albrecht, Mario
    Hermjakob, Henning
    [J]. NATURE METHODS, 2011, 8 (07) : 528 - 529
  • [4] The ubiquitin-like modifier FAT10 in antigen processing and antimicrobial defense
    Basler, Michael
    Buerger, Stefanie
    Groettrup, Marcus
    [J]. MOLECULAR IMMUNOLOGY, 2015, 68 (02) : 129 - 132
  • [5] PRISM: a web server and repository for prediction of protein-protein interactions and modeling their 3D complexes
    Baspinar, Alper
    Cukuroglu, Engin
    Nussinov, Ruth
    Keskin, Ozlem
    Gursoy, Attila
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) : W285 - W289
  • [6] Negatome 2.0: a database of non-interacting proteins derived by literature mining, manual annotation and protein structure analysis
    Blohm, Philipp
    Frishman, Goar
    Smialowski, Pawel
    Goebels, Florian
    Wachinger, Benedikt
    Ruepp, Andreas
    Frishman, Dmitrij
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D396 - D400
  • [7] Computational design of a synthetic PD-1 agonist
    Cassie, M. Bryana
    Rocklin, Gabriel J.
    Bick, Matthew J.
    Ford, Alex
    Majri-Morrison, Sonia
    Kroll, Ashley V.
    Miller, Chad J.
    Carter, Lauren
    Goreshnik, Inna
    Kang, Alex
    DiMaio, Frank
    Tarbell, Kristin V.
    Baker, David
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (29)
  • [8] Multifaceted protein-protein interaction prediction based on Siamese residual RCNN
    Chen, Muhao
    Ju, Chelsea J. -T.
    Zhou, Guangyu
    Chen, Xuelu
    Zhang, Tianran
    Chang, Kai-Wei
    Zaniolo, Carlo
    Wang, Wei
    [J]. BIOINFORMATICS, 2019, 35 (14) : I305 - I314
  • [9] High-throughput two-hybrid analysis
    Fields, S
    [J]. FEBS JOURNAL, 2005, 272 (21) : 5391 - 5399
  • [10] A computational interactome and functional annotation for the human proteome
    Garzon, Jose Ignacio
    Deng, Lei
    Murray, Diana
    Shapira, Sagi
    Petrey, Donald
    Honig, Barry
    [J]. ELIFE, 2016, 5