Multi-Property Molecular Optimization using an Integrated Poly-Cycle Architecture

被引:8
作者
Barshatski, Guy [1 ]
Nordon, Galia [1 ]
Radinsky, Kira [1 ]
机构
[1] Technion Israel Inst Technol, Haifa, Israel
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021 | 2021年
关键词
AI Multi-property Lead Optimization; Drug Discovery; ML for Healthcare;
D O I
10.1145/3459637.3481938
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Molecular lead optimization is an important task of drug discovery focusing on generating molecules similar to a drug candidate but with enhanced properties. Most prior works focused on optimizing a single property. However, in real settings, we wish to find molecules that satisfy multiple constraints, e.g., potency and safety. Simultaneously optimizing these constraints was shown to be difficult, mostly due to the lack of training examples satisfying all constraints. In this work, we present a novel approach for multi-property optimization. Unlike prior approaches, that require a large training set of pairs of a lead molecule and an enhanced molecule, our approach is unpaired. Our architecture learns a transformation for each property optimization separately, while constraining the latent embedding space between all transformations. This allows generating a molecule which optimizes multiple properties simultaneously. We present a novel adaptive loss which balances the separate transformations and stabilizes the optimization process. We evaluate our method on optimizing for two properties: dopamine receptor (DRD2) and drug likeness (QED), and show our method outperforms previous state-of-the-art, especially when training examples satisfying all constraints are sparse.
引用
收藏
页码:3727 / 3736
页数:10
相关论文
共 27 条
[1]  
[Anonymous], 2017, ARXIV171110907
[2]   Unpaired Generative Molecule-to-Molecule Translation for Lead Optimization [J].
Barshatski, Guy ;
Radinsky, Kira .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :2554-2564
[3]  
Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]
[4]  
Cho Kyunghyun, P 2014 C EMP METH NA, DOI 10.3115/v1/D14-1179
[5]  
Dai H., 2018, Syntax-Directed Variational Autoencoder for Structured Data
[6]  
Diederik J., 2015, P 3 INT C LEARN REPR, DOI DOI 10.48550/ARXIV.1412.6980
[7]  
Fu Tianfan, 2020, 34 AAAI C ART INT AA
[8]   Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules [J].
Gomez-Bombarelli, Rafael ;
Wei, Jennifer N. ;
Duvenaud, David ;
Hernandez-Lobato, Jose Miguel ;
Sanchez-Lengeling, Benjamin ;
Sheberla, Dennis ;
Aguilera-Iparraguirre, Jorge ;
Hirzel, Timothy D. ;
Adams, Ryan P. ;
Aspuru-Guzik, Alan .
ACS CENTRAL SCIENCE, 2018, 4 (02) :268-276
[9]   Accelerating Prototype-Based Drug Discovery using Conditional Diversity Networks [J].
Harel, Shahar ;
Radinsky, Kira .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :331-339
[10]  
Jin WG, 2020, PR MACH LEARN RES, V119