PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications

被引:8
作者
Korlepara, Divya B. [1 ]
Vasavi, C. S. [1 ]
Jeurkar, Shruti [1 ]
Pal, Pradeep Kumar [1 ]
Roy, Subhajit [1 ,2 ]
Mehta, Sarvesh [1 ]
Sharma, Shubham [1 ]
Kumar, Vishal [1 ]
Muvva, Charuvaka [1 ]
Sridharan, Bhuvanesh [1 ]
Garg, Akshit [1 ]
Modee, Rohit [1 ]
Bhati, Agastya P. [3 ]
Nayar, Divya [4 ]
Priyakumar, U. Deva [1 ]
机构
[1] Int Inst Informat Technol, Ctr Computat Nat Sci & Bioinformat, Hyderabad 500032, India
[2] Univ Mumbai, UM DAE Ctr Excellence Basic Sci, Mumbai, Maharashtra, India
[3] UCL, Ctr Computat Sci, Dept Chem, London WC1H 0AJ, England
[4] Indian Inst Technol Delhi, Dept Mat Sci & Engn, New Delhi 110016, India
关键词
BOLTZMANN SURFACE-AREA; ENERGY CALCULATIONS; BINDING MECHANISM; MM-PBSA; TUBERCULOSIS; PERFORMANCE; DISCOVERY; ACCURACY; DOCKING; SIMULATIONS;
D O I
10.1038/s41597-022-01631-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Computational methods and recently modern machine learning methods have played a key role in structure-based drug design. Though several benchmarking datasets are available for machine learning applications in virtual screening, accurate prediction of binding affinity for a protein-ligand complex remains a major challenge. New datasets that allow for the development of models for predicting binding affinities better than the state-of-the-art scoring functions are important. For the first time, we have developed a dataset, PLAS-5k comprised of 5000 protein-ligand complexes chosen from PDB database. The dataset consists of binding affinities along with energy components like electrostatic, van derWaals, polar and non-polar solvation energy calculated from molecular dynamics simulations using MMPBSA (Molecular Mechanics Poisson-Boltzmann Surface Area) method. The calculated binding affinities outperformed docking scores and showed a good correlation with the available experimental values. The availability of energy components may enable optimization of desired components during machine learning-based drug design. Further, OnionNet model has been retrained on PLAS-5k dataset and is provided as a baseline for the prediction of binding affinities.
引用
收藏
页数:10
相关论文
共 77 条
[1]   Improved Ligand Binding Energies Derived from Molecular Dynamics: Replicate Sampling Enhances the Search of Conformational Space [J].
Adler, Marc ;
Beroza, Paul .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (08) :2065-2072
[2]   Statistical Analysis on the Performance of Molecular Mechanics Poisson-Boltzmann Surface Area versus Absolute Binding Free Energy Calculations: Bromodomains as a Case Study [J].
Aldeghi, Matteo ;
Bodkin, Michael J. ;
Knapp, Stefan ;
Biggin, Philip C. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (09) :2203-2221
[3]   Protein conformational flexibility modulates kinetics and thermodynamics of drug binding [J].
Amaral, M. ;
Kokh, D. B. ;
Bomke, J. ;
Wegener, A. ;
Buchstaller, H. P. ;
Eggenweiler, H. M. ;
Matias, P. ;
Sirrenberg, C. ;
Wade, R. C. ;
Frech, M. .
NATURE COMMUNICATIONS, 2017, 8
[4]  
[Anonymous], 2015, arXiv
[5]  
[Anonymous], 2009, GAUSSVIEW VERSION 5
[6]   Lessons from repurposing HIV drugs: a prospective novel strategy for drug design [J].
Batman, Gavin ;
Hampson, Lynne ;
Hampson, Ian N. .
FUTURE VIROLOGY, 2011, 6 (09) :1021-1023
[7]   An evaluation of indirubin analogues as phosphorylase kinase inhibitors [J].
Begum, Jaida ;
Skamnaki, Vassiliki T. ;
Moffatt, Colin ;
Bischler, Nicolas ;
Sarrou, Josephine ;
Skaltsounis, Alexios-Leandros ;
Leonidas, Demetres D. ;
Oikonomakos, Nikos G. ;
Hayes, Joseph M. .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2015, 61 :231-242
[8]   TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions [J].
Cang, Zixuan ;
Wei, Guowei .
PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (07)
[9]   The Amber biomolecular simulation programs [J].
Case, DA ;
Cheatham, TE ;
Darden, T ;
Gohlke, H ;
Luo, R ;
Merz, KM ;
Onufriev, A ;
Simmerling, C ;
Wang, B ;
Woods, RJ .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2005, 26 (16) :1668-1688
[10]   Deciphering the mechanism behind the varied binding activities of COXIBs through Molecular Dynamic Simulations, MM-PBSA binding energy calculations and per-residue energy decomposition studies [J].
Chaudhary, Neha ;
Aparoy, Polamarasetty .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2017, 35 (04) :868-882