Generative Models Should at Least Be Able to Design Molecules That Dock Well: A New Benchmark

被引:18
作者
Cieplinski, Tobiasz [1 ]
Danel, Tomasz [1 ]
Podlewska, Sabina [2 ]
Jastrzebski, Stanislaw [1 ,3 ]
机构
[1] Jagiellonian Univ, Fac Math & Comp Sci, PL-30348 Krakow, Poland
[2] Polish Acad Sci, Maj Inst Pharmacol, PL-31343 Krakow, Poland
[3] Mol One, PL-00807 Warsaw, Poland
关键词
SCORING FUNCTION; DRUG DISCOVERY;
D O I
10.1021/acs.jcim.2c01355
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Designing compounds with desired properties is a keyelement ofthe drug discovery process. However, measuring progress in the fieldhas been challenging due to the lack of realistic retrospective benchmarks,and the large cost of prospective validation. To close this gap, wepropose a benchmark based on docking, a widely used computationalmethod for assessing molecule binding to a protein. Concretely, thegoal is to generate drug-like molecules that are scored highly bySMINA, a popular docking software. We observe that various graph-basedgenerative models fail to propose molecules with a high docking scorewhen trained using a realistically sized training set. This suggestsa limitation of the current incarnation of models for de novo drug design. Finally, we also include simpler tasks in the benchmarkbased on a simpler scoring function. We release the benchmark as aneasy to use package available at https://github.com/cieplinski-tobiasz/smina-docking-benchmark. We hope that our benchmark will serve as a stepping stone towardthe goal of automatically generating promising drug candidates.
引用
收藏
页码:3238 / 3247
页数:10
相关论文
共 49 条
[1]  
[Anonymous], 2016, RDKIT OPEN SOURCE CH
[2]  
Aumentado-Armstrong T., 2018, ARXIV PREPRINT ARXIV
[3]  
Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/nchem.1243, 10.1038/NCHEM.1243]
[4]   GuacaMol: Benchmarking Models for de Novo Molecular Design [J].
Brown, Nathan ;
Fiscato, Marco ;
Segler, Marwin H. S. ;
Vaucher, Alain C. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (03) :1096-1108
[5]   The rise of deep learning in drug discovery [J].
Chen, Hongming ;
Engkvist, Ola ;
Wang, Yinhai ;
Olivecrona, Marcus ;
Blaschke, Thomas .
DRUG DISCOVERY TODAY, 2018, 23 (06) :1241-1250
[6]  
Cieplinski T., 2020, ARXIV PREPRINT ARXIV
[7]   Autonomous Discovery in the Chemical Sciences Part II: Outlook [J].
Coley, Connor W. ;
Eyke, Natalie S. ;
Jensen, Klavs F. .
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2020, 59 (52) :23414-23436
[8]   AUTOMATED SITE-DIRECTED DRUG DESIGN - A GENERAL ALGORITHM FOR KNOWLEDGE ACQUISITION ABOUT HYDROGEN-BONDING REGIONS AT PROTEIN SURFACES [J].
DANZIGER, DJ ;
DEAN, PM .
PROCEEDINGS OF THE ROYAL SOCIETY SERIES B-BIOLOGICAL SCIENCES, 1989, 236 (1283) :101-+
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Drotar P., 2021, ARXIV PREPRINTARXIV2