A multidimensional dataset for structure-based machine learning

被引:0
作者
Holcomb, Matthew [1 ]
Forli, Stefano [1 ]
机构
[1] Scripps Res, Dept Integrat Struct & Computat Biol, La Jolla, CA 92037 USA
来源
NATURE COMPUTATIONAL SCIENCE | 2024年 / 4卷 / 05期
关键词
D O I
10.1038/s43588-024-00631-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
MISATO, a dataset for structure-based drug discovery combines quantum mechanics property data and molecular dynamics simulations on similar to 20,000 protein-ligand structures, substantially extends the amount of data available to the community and holds potential for advancing work in drug discovery.
引用
收藏
页码:318 / 319
页数:2
相关论文
共 9 条
[1]  
Aggarwal R., 2021, PREPRINT
[2]   PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences [J].
Buttenschoen, Martin ;
Morris, Garrett M. ;
Deane, Charlotte M. .
CHEMICAL SCIENCE, 2024, 15 (09) :3130-3139
[3]   Quantum chemical benchmark databases of gold-standard dimer interaction energies [J].
Donchev, Alexander G. ;
Taube, Andrew G. ;
Decolvenaere, Elizabeth ;
Hargus, Cory ;
McGibbon, Robert T. ;
Law, Ka-Hei ;
Gregersen, Brent A. ;
Li, Je-Luen ;
Palmo, Kim ;
Siva, Karthik ;
Bergdorf, Michael ;
Klepeis, John L. ;
Shaw, David E. .
SCIENTIFIC DATA, 2021, 8 (01)
[4]   AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python']Python Bindings [J].
Eberhardt, Jerome ;
Santos-Martins, Diogo ;
Tillack, Andreas F. ;
Forli, Stefano .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (08) :3891-3898
[5]   PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications [J].
Korlepara, Divya B. ;
Vasavi, C. S. ;
Jeurkar, Shruti ;
Pal, Pradeep Kumar ;
Roy, Subhajit ;
Mehta, Sarvesh ;
Sharma, Shubham ;
Kumar, Vishal ;
Muvva, Charuvaka ;
Sridharan, Bhuvanesh ;
Garg, Akshit ;
Modee, Rohit ;
Bhati, Agastya P. ;
Nayar, Divya ;
Priyakumar, U. Deva .
SCIENTIFIC DATA, 2022, 9 (01)
[6]   FreeSolv: a database of experimental and calculated hydration free energies, with input files [J].
Mobley, David L. ;
Guthrie, J. Peter .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2014, 28 (07) :711-720
[7]  
PDB Statistics, 2024, OVERALL GROWTH RELEA
[8]   MISATO: machine learning dataset of protein-ligand complexes for structure-based drug discovery [J].
Siebenmorgen, Till ;
Menezes, Filipe ;
Benassou, Sabrina ;
Merdivan, Erinc ;
Didi, Kieran ;
Mourao, Andre Santos Dias ;
Kitel, Radoslaw ;
Lio, Pietro ;
Kesselheim, Stefan ;
Piraud, Marie ;
Theis, Fabian J. ;
Sattler, Michael ;
Popowicz, Grzegorz M. .
NATURE COMPUTATIONAL SCIENCE, 2024, 4 (05) :367-378
[9]   The PDBbind database: Collection of binding affinities for protein-ligand complexes with known three-dimensional structures [J].
Wang, RX ;
Fang, XL ;
Lu, YP ;
Wang, SM .
JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (12) :2977-2980