Combining Docking Pose Rank and Structure with Deep Learning Improves Protein-Ligand Binding Mode Prediction over a Baseline Docking Approach

被引:70
作者
Morrone, Joseph A. [1 ]
Weber, Jeffrey K. [1 ]
Tien Huynh [1 ]
Luo, Heng [1 ]
Cornell, Wendy D. [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Healthcare & Life Sci Res, Yorktown Hts, NY 10598 USA
关键词
SCORING FUNCTIONS; BENCHMARK; VALIDATION; SIMILARITY; SELECTION; SETS; OPTIMIZATION; DESCRIPTOR; ACCURACY; SEARCH;
D O I
10.1021/acs.jcim.9b00927
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
We present a simple, modular graph-based convolutional neural network that takes structural information from protein-ligand complexes as input to generate models for activity and binding mode prediction. Complex structures are generated by a standard docking procedure and fed into a dual-graph architecture that includes separate subnetworks for the ligand bonded topology and the ligand-protein contact map. Recent work has indicated that data set bias drives many past promising results derived from combining deep learning and docking. Our dual-graph network allows contributions from ligand identity that give rise to such biases to be distinguished from effects of protein-ligand interactions on classification. We show that our neural network is capable of learning from protein structural information when, as in the case of binding mode prediction, an unbiased data set is constructed. We next develop a deep learning model for binding mode prediction that uses docking ranking as input in combination with docking structures. This strategy mirrors past consensus models and outperforms a baseline docking program (AutoDock Vina) in a variety of tests, including on cross-docking data sets that mimic real-world docking use cases. Furthermore, the magnitudes of network predictions serve as reliable measures of model confidence.
引用
收藏
页码:4170 / 4179
页数:10
相关论文
共 59 条
[1]   Low Data Drug Discovery with One-Shot Learning [J].
Altae-Tran, Han ;
Ramsundar, Bharath ;
Pappu, Aneesh S. ;
Pande, Vijay .
ACS CENTRAL SCIENCE, 2017, 3 (04) :283-293
[2]  
[Anonymous], 2015, ADV NEURAL INFORM PR
[3]  
[Anonymous], 2019, PLOS ONE
[4]  
[Anonymous], 2016, SOFTWARE
[5]   Improvement of Virtual Screening Results by Docking Data Feature Analysis [J].
Arciniega, Marcelino ;
Lange, Oliver F. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (05) :1401-1411
[6]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[7]   Machine learning optimization of cross docking accuracy [J].
Bjerrum, Esben J. .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2016, 62 :133-144
[8]   Evaluation of DOCK 6 as a pose generation and database enrichment tool [J].
Brozell, Scott R. ;
Mukherjee, Sudipto ;
Balius, Trent E. ;
Roe, Daniel R. ;
Case, David A. ;
Rizzo, Robert C. .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2012, 26 (06) :749-773
[9]   Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database [J].
Butkiewicz, Mariusz ;
Lowe, Edward W., Jr. ;
Mueller, Ralf ;
Mendenhall, Jeffrey L. ;
Teixeira, Pedro L. ;
Weaver, C. David ;
Meiler, Jens .
MOLECULES, 2013, 18 (01) :735-756
[10]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73