Exploring Graph Traversal Algorithms in Graph-Based Molecular Generation

被引:13
作者
Mercado, Rocio [1 ]
Bjerrum, Esben J. [1 ]
Engkvist, Ola [1 ,2 ]
机构
[1] AstraZeneca Gothenburg, R&D, Discovery Sci, Mol AI, S-43150 Molndal, Sweden
[2] Chalmers Univ Technol, Dept Comp Sci & Engn, S-41296 Gothenburg, Sweden
关键词
LIBRARIES;
D O I
10.1021/acs.jcim.1c00777
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Here, we explore the impact of different graph traversal algorithms on molecular graph generation. We do this by training a graph-based deep molecular generative model to build structures using a node order determined via either a breadth- or depth-first search algorithm. "What we observe is that using a breadth-first traversal leads to better coverage of training data features compared to a depth-first traversal. We have quantified these differences using a variety of metrics on a data set of natural products. These metrics include percent validity, molecular coverage, and molecular shape. We also observe that by using either a breadth- or depth-first traversal it is possible to overtrain the generative models, at which point the results with either graph traversal algorithm are identical.
引用
收藏
页码:2093 / 2100
页数:8
相关论文
共 30 条
[1]   Two-and Three-dimensional Rings in Drugs [J].
Aldeghi, Matteo ;
Malhotra, Shipra ;
Selwood, David L. ;
Chan, Ah Wing Edith .
CHEMICAL BIOLOGY & DRUG DESIGN, 2014, 83 (04) :450-461
[2]   REINVENT 2.0: An AI Tool for De Novo Drug Design [J].
Blaschke, Thomas ;
Arus-Pous, Josep ;
Chen, Hongming ;
Margreitter, Christian ;
Tyrchan, Christian ;
Engkvist, Ola ;
Papadopoulos, Kostas ;
Patronov, Atanas .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (12) :5918-5922
[3]  
Bradshaw J., 2020, PREPRINT
[4]  
Bradshaw J, 2019, ADV NEUR IN, V32
[5]   Molecular representations in AI-driven drug discovery: a review and practical guide [J].
David, Laurianne ;
Thakkar, Amol ;
Mercado, Rocio ;
Engkvist, Ola .
JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
[6]   An algorithm to identify functional groups in organic molecules [J].
Ertl, Peter .
JOURNAL OF CHEMINFORMATICS, 2017, 9
[7]   Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules [J].
Gomez-Bombarelli, Rafael ;
Wei, Jennifer N. ;
Duvenaud, David ;
Hernandez-Lobato, Jose Miguel ;
Sanchez-Lengeling, Benjamin ;
Sheberla, Dennis ;
Aguilera-Iparraguirre, Jorge ;
Hirzel, Timothy D. ;
Adams, Ryan P. ;
Aspuru-Guzik, Alan .
ACS CENTRAL SCIENCE, 2018, 4 (02) :268-276
[8]  
Gottipati S. K., 2020, PREPRINT
[9]   Artificial intelligence in drug discovery: recent advances and future perspectives [J].
Jimenez-Luna, Jose ;
Grisoni, Francesca ;
Weskamp, Nils ;
Schneider, Gisbert .
EXPERT OPINION ON DRUG DISCOVERY, 2021, 16 (09) :949-959
[10]  
Jin W., 2020, PREPRINT