Efficient learning of non-autoregressive graph variational autoencoders for molecular graph generation

被引:44
作者
Kwon, Youngchun [1 ,2 ]
Yoo, Jiho [1 ]
Choi, Youn-Suk [1 ]
Son, Won-Joon [1 ]
Lee, Dongseon [1 ]
Kang, Seokho [3 ]
机构
[1] Samsung Elect Co Ltd, Samsung Adv Inst Technol, 130 Samsung Ro, Suwon, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, 1 Gwanak Ro, Seoul, South Korea
[3] Sungkyunkwan Univ, Dept Syst Management Engn, 2066 Seobu Ro, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
Molecular graph; Variational autoencoder; Graph neural network; Deep learning; DRUG DISCOVERY; COMBINATORIAL; LIBRARIES; PREDICTION; DATABASE;
D O I
10.1186/s13321-019-0396-x
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With the advancements in deep learning, deep generative models combined with graph neural networks have been successfully employed for data-driven molecular graph generation. Early methods based on the non-autoregressive approach have been effective in generating molecular graphs quickly and efficiently but have suffered from low performance. In this paper, we present an improved learning method involving a graph variational autoencoder for efficient molecular graph generation in a non-autoregressive manner. We introduce three additional learning objectives and incorporate them into the training of the model: approximate graph matching, reinforcement learning, and auxiliary property prediction. We demonstrate the effectiveness of the proposed method by evaluating it for molecular graph generation tasks using QM9 and ZINC datasets. The model generates molecular graphs with high chemical validity and diversity compared with existing non-autoregressive methods. It can also conditionally generate molecular graphs satisfying various target conditions.
引用
收藏
页数:10
相关论文
共 42 条
[1]  
[Anonymous], 2017, ARXIV PREPRINT ARXIV
[2]  
[Anonymous], 2018, ICML
[3]  
[Anonymous], 2017, PMLR
[4]  
Battaglia P. W., 2018, ARXIV 180601261
[5]  
Bresson X., 2019, ARXIV190603412
[6]   GuacaMol: Benchmarking Models for de Novo Molecular Design [J].
Brown, Nathan ;
Fiscato, Marco ;
Segler, Marwin H. S. ;
Vaucher, Alain C. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (03) :1096-1108
[7]   The rise of deep learning in drug discovery [J].
Chen, Hongming ;
Engkvist, Ola ;
Wang, Yinhai ;
Olivecrona, Marcus ;
Blaschke, Thomas .
DRUG DISCOVERY TODAY, 2018, 23 (06) :1241-1250
[8]  
Dai H., 2018, P INT C LEARN REPR
[9]  
De Cao N., 2018, MolGAN: an implicit generative model for small molecular graphs.
[10]   The ChEMBL database in 2017 [J].
Gaulton, Anna ;
Hersey, Anne ;
Nowotka, Michal ;
Bento, A. Patricia ;
Chambers, Jon ;
Mendez, David ;
Mutowo, Prudence ;
Atkinson, Francis ;
Bellis, Louisa J. ;
Cibrian-Uhalte, Elena ;
Davies, Mark ;
Dedman, Nathan ;
Karlsson, Anneli ;
Magarinos, Maria Paula ;
Overington, John P. ;
Papadatos, George ;
Smit, Ines ;
Leach, Andrew R. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D945-D954