Generative Models Should at Least Be Able to Design Molecules That Dock Well: A New Benchmark

被引:18
作者
Cieplinski, Tobiasz [1 ]
Danel, Tomasz [1 ]
Podlewska, Sabina [2 ]
Jastrzebski, Stanislaw [1 ,3 ]
机构
[1] Jagiellonian Univ, Fac Math & Comp Sci, PL-30348 Krakow, Poland
[2] Polish Acad Sci, Maj Inst Pharmacol, PL-31343 Krakow, Poland
[3] Mol One, PL-00807 Warsaw, Poland
关键词
SCORING FUNCTION; DRUG DISCOVERY;
D O I
10.1021/acs.jcim.2c01355
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Designing compounds with desired properties is a keyelement ofthe drug discovery process. However, measuring progress in the fieldhas been challenging due to the lack of realistic retrospective benchmarks,and the large cost of prospective validation. To close this gap, wepropose a benchmark based on docking, a widely used computationalmethod for assessing molecule binding to a protein. Concretely, thegoal is to generate drug-like molecules that are scored highly bySMINA, a popular docking software. We observe that various graph-basedgenerative models fail to propose molecules with a high docking scorewhen trained using a realistically sized training set. This suggestsa limitation of the current incarnation of models for de novo drug design. Finally, we also include simpler tasks in the benchmarkbased on a simpler scoring function. We release the benchmark as aneasy to use package available at https://github.com/cieplinski-tobiasz/smina-docking-benchmark. We hope that our benchmark will serve as a stepping stone towardthe goal of automatically generating promising drug candidates.
引用
收藏
页码:3238 / 3247
页数:10
相关论文
共 49 条
[21]   Lessons Learned in Empirical Scoring with smina from the CSAR 2011 Benchmarking Exercise [J].
Koes, David Ryan ;
Baumgartner, Matthew P. ;
Camacho, Carlos J. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (08) :1893-1904
[22]  
Kusner M. J., 2017, PROC 34 INT C MACH L, P1945
[23]   Virtual Screening Strategies in Drug Discovery: A Critical Review [J].
Lavecchia, A. ;
Di Giovanni, C. .
CURRENT MEDICINAL CHEMISTRY, 2013, 20 (23) :2839-2860
[24]  
Luo Shitong, 2021, Advances in Neural Information Processing Systems, V34
[25]   Mol-CycleGAN: a generative model for molecular optimization [J].
Maziarka, Lukasz ;
Pocha, Agnieszka ;
Kaczmarczyk, Jan ;
Rataj, Krzysztof ;
Danel, Tomasz ;
Warchot, Michal .
JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
[26]   MEMES: Machine learning framework for Enhanced MolEcular Screening [J].
Mehta, Sarvesh ;
Laghuvarapu, Siddhartha ;
Pathak, Yashaswi ;
Sethi, Aaftaab ;
Alvala, Mallika ;
Priyakumar, U. Deva .
CHEMICAL SCIENCE, 2021, 12 (35) :11710-11721
[27]   DSX: A Knowledge-Based Scoring Function for the Assessment of Protein-Ligand Complexes [J].
Neudert, Gerd ;
Klebe, Gerhard .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (10) :2731-2745
[28]   Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design [J].
Nigam, AkshatKumar ;
Pollice, Robert ;
Aspuru-Guzik, Alan .
DIGITAL DISCOVERY, 2022, 1 (04) :390-404
[29]   Molecular de-novo design through deep reinforcement learning [J].
Olivecrona, Marcus ;
Blaschke, Thomas ;
Engkvist, Ola ;
Chen, Hongming .
JOURNAL OF CHEMINFORMATICS, 2017, 9
[30]   Learning from the Harvard Clean Energy Project: The Use of Neural Networks to Accelerate Materials Discovery [J].
Pyzer-Knapp, Edward O. ;
Li, Kewei ;
Aspuru-Guzik, Alan .
ADVANCED FUNCTIONAL MATERIALS, 2015, 25 (41) :6495-6502