Exploring Graph Traversal Algorithms in Graph-Based Molecular Generation

被引:13
作者
Mercado, Rocio [1 ]
Bjerrum, Esben J. [1 ]
Engkvist, Ola [1 ,2 ]
机构
[1] AstraZeneca Gothenburg, R&D, Discovery Sci, Mol AI, S-43150 Molndal, Sweden
[2] Chalmers Univ Technol, Dept Comp Sci & Engn, S-41296 Gothenburg, Sweden
关键词
LIBRARIES;
D O I
10.1021/acs.jcim.1c00777
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Here, we explore the impact of different graph traversal algorithms on molecular graph generation. We do this by training a graph-based deep molecular generative model to build structures using a node order determined via either a breadth- or depth-first search algorithm. "What we observe is that using a breadth-first traversal leads to better coverage of training data features compared to a depth-first traversal. We have quantified these differences using a variety of metrics on a data set of natural products. These metrics include percent validity, molecular coverage, and molecular shape. We also observe that by using either a breadth- or depth-first traversal it is possible to overtrain the generative models, at which point the results with either graph traversal algorithm are identical.
引用
收藏
页码:2093 / 2100
页数:8
相关论文
共 30 条
[21]   Molecular shape diversity of combinatorial libraries: A prerequisite for broad bioactivity [J].
Sauer, WHB ;
Schwarz, MK .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (03) :987-1003
[22]   Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks [J].
Segler, Marwin H. S. ;
Kogej, Thierry ;
Tyrchan, Christian ;
Waller, Mark P. .
ACS CENTRAL SCIENCE, 2018, 4 (01) :120-131
[23]  
Shi C., 2021, PREPRINT
[24]  
Simm G. N. C., 2020, PREPRINT
[25]  
Skiena S.S., 2008, The Algorithm Design Manual, P103
[26]   COCONUT online: Collection of Open Natural Products database [J].
Sorokina, Maria ;
Merseburger, Peter ;
Rajan, Kohulan ;
Yirik, Mehmet Aziz ;
Steinbeck, Christoph .
JOURNAL OF CHEMINFORMATICS, 2021, 13 (01)
[27]  
Xu M., PREPRINT
[28]  
You J., 2018, PREPRINT
[29]   MoFlow: An Invertible Flow Model for Generating Molecular Graphs [J].
Zang, Chengxi ;
Wang, Fei .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :617-626
[30]   Comparative Study of Deep Generative Models on Chemical Space Coverage [J].
Zhang, Jie ;
Mercado, Rocio ;
Engkvist, Ola ;
Chen, Hongming .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (06) :2572-2581