Parameter estimation for scoring protein-ligand interactions using negative training data

被引:137
作者
Pham, Tuan A.
Jain, Ajay N.
机构
[1] Univ Calif San Francisco, Inst Canc Res, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Dept Lab Med, San Francisco, CA 94143 USA
关键词
D O I
10.1021/jm050040j
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Surflex-Dock employs an empirically derived scoring function to rank putative protein-ligand interactions by flexible docking of small molecules to proteins of known structure. The scoring function employed by Surflex was developed purely on the basis of positive data, comprising noncovalent protein-ligand complexes with known binding affinities. Consequently, scoring function terms for improper interactions received little weight in parameter estimation, and an ad hoc scheme for avoiding protein-ligand interpenetration was adopted. We present a generalized method for incorporating synthetically generated negative training data, which allows for rigorous estimation of all scoring function parameters. Geometric docking accuracy remained excellent under the new parametrization. In addition, a test of screening utility covering a diverse set of 29 proteins and corresponding ligand sets showed improved performance. Maximal enrichment of true ligands over nonligands exceeded 20-fold in over 80% of cases, with enrichment of greater than 100-fold in over 50% of cases.
引用
收藏
页码:5856 / 5868
页数:13
相关论文
共 50 条
[21]   A Bayesian Statistical Approach of Improving KnowledgeBased Scoring Functions for Protein-Ligand Interactions [J].
Grinter, Sam Z. ;
Zou, Xiaoqin .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2014, 35 (12) :932-943
[22]   Convolutional neural networks for protein-ligand scoring [J].
Ragoza, Matt ;
Collins, Jasmine ;
Koes, David .
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
[23]   Protein-ligand scoring with convolutional neural networks [J].
Koes, David ;
Ragoza, Matthew ;
Idrobo, Elisa ;
Hochuli, Joshua ;
Sunseri, Jocelyn .
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
[24]   Using Thrombin to Study Protein-Ligand Interactions. [J].
Banner, David .
ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2004, 60 :S27-S27
[25]   Alternative network architectures for protein-ligand scoring [J].
Turner, Lillian ;
Mittal, Pulkit ;
Ragoza, Matthew ;
Koes, David .
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
[26]   Survey of the scoring functions for protein-ligand docking [J].
Sapundzhi, Fatima ;
Prodanova, Krasimira ;
Lazarova, Meglena .
PROCEEDINGS OF THE 45TH INTERNATIONAL CONFERENCE ON APPLICATION OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE'19), 2019, 2172
[27]   Evaluation of Scoring Functions for Protein-ligand Docking [J].
Pakpahan, M. T. ;
Rusmerryani, M. ;
Kawaguchi, K. ;
Saito, H. ;
Nagao, H. .
4TH INTERNATIONAL SYMPOSIUM ON SLOW DYNAMICS IN COMPLEX SYSTEMS: KEEP GOING TOHOKU, 2013, 1518 :645-648
[28]   Protein-ligand docking strategies and scoring functions [J].
Godden, J ;
Stahura, F ;
Bajorath, J .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 1998, 16 (4-6) :292-292
[29]   Protein-Ligand Scoring with Convolutional Neural Networks [J].
Ragoza, Matthew ;
Hochuli, Joshua ;
Idrobo, Elisa ;
Sunseri, Jocelyn ;
Koes, David Ryan .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (04) :942-957
[30]   Convolutional neural networks for protein-ligand scoring [J].
Ragoza, Matthew ;
Collins, Jasmine ;
Koes, David .
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251