Flexible protein-protein docking with a multitrack iterative transformer

被引:5
|
作者
Chu, Lee-Shin [1 ]
Ruffolo, Jeffrey A. [2 ]
Harmalkar, Ameya [1 ]
Gray, Jeffrey J. [1 ,2 ,3 ]
机构
[1] Johns Hopkins Univ, Dept Chem & Biomol Engn, Baltimore, MD USA
[2] Johns Hopkins Univ, Program Mol Biophys, Baltimore, MD USA
[3] Johns Hopkins Univ, Dept Chem & Biomol Engn, Baltimore, MD 21218 USA
基金
美国国家卫生研究院;
关键词
deep learning; flexible protein docking; protein-protein interaction; SHAPE COMPLEMENTARITY; PREDICTION; ELECTROSTATICS; BACKBONE; CAPRI; OPTIMIZATION; RECOGNITION; PERFORMANCE; BENCHMARK; HADDOCK;
D O I
10.1002/pro.4862
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Conventional protein-protein docking algorithms usually rely on heavy candidate sampling and reranking, but these steps are time-consuming and hinder applications that require high-throughput complex structure prediction, for example, structure-based virtual screening. Existing deep learning methods for protein-protein docking, despite being much faster, suffer from low docking success rates. In addition, they simplify the problem to assume no conformational changes within any protein upon binding (rigid docking). This assumption precludes applications when binding-induced conformational changes play a role, such as allosteric inhibition or docking from uncertain unbound model structures. To address these limitations, we present GeoDock, a multitrack iterative transformer network to predict a docked structure from separate docking partners. Unlike deep learning models for protein structure prediction that input multiple sequence alignments, GeoDock inputs just the sequences and structures of the docking partners, which suits the tasks when the individual structures are given. GeoDock is flexible at the protein residue level, allowing the prediction of conformational changes upon binding. On the Database of Interacting Protein Structures (DIPS) test set, GeoDock achieves a 43% top-1 success rate, outperforming all other tested methods. However, in the standard DIPS train/test splits, we discovered contamination of close homologs in the training set. After decontaminating the training set, the success rate is 31%. On the DB5.5 test set and a benchmark dataset of antibody-antigen complexes, GeoDock outperforms the deep learning models trained using the same dataset but falls behind most of the conventional methods and AlphaFold-Multimer. GeoDock attains an average inference speed of under 1 s on a single GPU, enabling its application in large-scale structure screening. Although binding-induced conformational changes are still a challenge owing to limited training and evaluation data, our architecture sets up the foundation to capture this backbone flexibility. Code and a demonstration Jupyter notebook are available at .
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Flexible protein-protein docking
    Bonvin, AM
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (02) : 194 - 200
  • [2] Principles of flexible protein-protein docking
    Andrusier, Nelly
    Mashiach, Efrat
    Nussinov, Ruth
    Wolfson, Haim J.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 73 (02) : 271 - 289
  • [3] Using MELD in flexible protein-protein docking
    Brini, Emiliano
    Dill, Ken
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [4] SwarmDock: a server for flexible protein-protein docking
    Torchala, Mieczyslaw
    Moal, Iain H.
    Chaleil, Raphael A. G.
    Fernandez-Recio, Juan
    Bates, Paul A.
    BIOINFORMATICS, 2013, 29 (06) : 807 - 809
  • [5] Flexible protein-protein docking with discrete molecular dynamics
    Emperador, A.
    FEBS JOURNAL, 2012, 279 : 532 - 532
  • [6] Efficient flexible backbone protein-protein docking for challenging targets
    Marze, Nicholas A.
    Burman, Shourya S. Roy
    Sheffler, William
    Gray, Jeffrey J.
    BIOINFORMATICS, 2018, 34 (20) : 3461 - 3469
  • [7] A Web Interface for Easy Flexible Protein-Protein Docking with ATTRACT
    de Vries, Sjoerd J.
    Schindler, Christina E. M.
    de Beauchene, Isaure Chauvot
    Zacharias, Martin
    BIOPHYSICAL JOURNAL, 2015, 108 (03) : 462 - 465
  • [8] Protein-Protein Docking
    Ehrlich, Lutz P.
    Wade, Rebecca C.
    REVIEWS IN COMPUTATIONAL CHEMISTRY, VOL 17, 2001, 17 : 61 - 97
  • [9] A Large Decoy Set of Protein-Protein Complexes Produced by Flexible Docking
    Launay, Guillaume
    Simonson, Thomas
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2011, 32 (01) : 106 - 120
  • [10] Inclusion of Cryo-EM data in flexible protein-protein docking
    Zacharias, Martin
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256