The Impact of Docking Pose Generation Error on the Prediction of Binding Affinity

被引:6
作者
Li, Hongjian [1 ]
Leung, Kwong-Sak [1 ]
Wong, Man-Hon [1 ]
Ballester, Pedro J. [2 ,3 ,4 ,5 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Shatin, Hong Kong, Peoples R China
[2] INSERM, U1068, Canc Res Ctr Marseille, F-13009 Marseille, France
[3] Inst J Paoli I Calmettes, F-13009 Marseille, France
[4] Aix Marseille Univ, F-13284 Marseille, France
[5] CNRS, UMR7258, F-13009 Marseille, France
来源
COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2014 | 2015年 / 8623卷
关键词
molecular docking; scoring functions; random forest; chemical informatics; structural bioinformatics; SCORING FUNCTIONS; PROTEIN;
D O I
10.1007/978-3-319-24462-4_20
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Docking is a computational technique that predicts the preferred conformation and binding affinity of a ligand molecule as bound to a protein pocket. It is often employed to identify a molecule that binds tightly to the target, so that a small concentration of the molecule is sufficient to modulate its biochemical function. The use of non-parametric machine learning, a data-driven approach that circumvents the need of modeling assumptions, has recently been shown to introduce a large improvement in the accuracy of docking scoring. However, the impact of pose generation error on binding affinity prediction is still to be investigated. Here we show that the impact of pose generation is generally limited to a small decline in the accuracy of scoring. These machine-learning scoring functions retained the highest performance on PDBbind v2007 core set in this common scenario where one has to predict the binding affinity of docked poses instead of that of co-crystallized poses (e.g. drug lead optimization). Nevertheless, we observed that these functions do not perform so well at predicting the near-native pose of a ligand. This suggests that having different scoring functions for different problems is a better approach than using the same scoring function for all problems.
引用
收藏
页码:231 / 241
页数:11
相关论文
共 13 条
[1]  
Ballester PJ, 2012, LECT NOTES COMPUT SC, V7632, P14, DOI 10.1007/978-3-642-34123-6_2
[2]   Does a More Precise Chemical Description of Protein-Ligand Complexes Lead to More Accurate Prediction of Binding Affinity? [J].
Ballester, Pedro J. ;
Schreyer, Adrian ;
Blundell, Tom L. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (03) :944-955
[3]   Hierarchical virtual screening for the discovery of new molecular scaffolds in antibacterial hit identification [J].
Ballester, Pedro J. ;
Mangold, Martina ;
Howard, Nigel I. ;
Robinson, Richard L. Marchese ;
Abell, Chris ;
Blumberger, Jochen ;
Mitchell, John B. O. .
JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2012, 9 (77) :3196-3207
[4]   Comments on "Leave-Cluster-Out Cross-Validation Is Appropriate for Scoring Functions Derived from Diverse Protein Data Sets": Significance for the Validation of Scoring Functions [J].
Ballester, Pedro J. ;
Mitchell, John B. O. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (08) :1739-1741
[5]   A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking [J].
Ballester, Pedro J. ;
Mitchell, John B. O. .
BIOINFORMATICS, 2010, 26 (09) :1169-1175
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Comparative Assessment of Scoring Functions on a Diverse Test Set [J].
Cheng, Tiejun ;
Li, Xun ;
Li, Yan ;
Liu, Zhihai ;
Wang, Renxiao .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (04) :1079-1093
[8]  
Li H, 2014, CURR EYE RES, V9, P1, DOI DOI 10.1109/JSTARS.2014.2348173
[9]   iview: an interactive WebGL visualizer for protein-ligand complex [J].
Li, Hongjian ;
Leung, Kwong-Sak ;
Nakane, Takanori ;
Wong, Man-Hon .
BMC BIOINFORMATICS, 2014, 15
[10]   AutoDock4 and AutoDockTools4: Automated Docking with Selective Receptor Flexibility [J].
Morris, Garrett M. ;
Huey, Ruth ;
Lindstrom, William ;
Sanner, Michel F. ;
Belew, Richard K. ;
Goodsell, David S. ;
Olson, Arthur J. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (16) :2785-2791