Meta Learning for Low-Resource Molecular Optimization

被引：25

作者：

Wang, Jiahao ^{[1
]}

Zheng, Shuangjia ^{[1
,2
]}

Chen, Jianwen ^{[1
]}

Yang, Yuedong ^{[1
,3
]}

机构：

[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China

[2] Galixir Technol Beijing Ltd, Beijing 100083, Peoples R China

[3] Sun Yat Sen Univ, Key Lab Machine Intelligence & Adv Comp MOE, Guangzhou 510006, Peoples R China

来源：

JOURNAL OF CHEMICAL INFORMATION AND MODELING | 2021年 / 61卷 / 04期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

DISCOVERY;

D O I：

10.1021/acs.jcim.0c01416

中图分类号：

R914 [药物化学];

学科分类号：

100701 ;

摘要：

The goal of molecular optimization (MO) is to discover molecules that acquire improved pharmaceutical properties over a known starting molecule. Despite many recent successes of new approaches for MO, these methods were typically developed for particular properties with rich annotated training examples. Thus, these approaches are difficult to implement in real scenes where only a small amount of pharmaceutical data is usually available due to the expense and significant effort required for the data collection. Here, we propose a new approach, Meta-MO, for molecular optimization with a handful of training samples based on the well-recognized first-order meta-learning algorithms. By using a set of meta tasks with rich training samples, Meta-MO trains a meta model through the meta-learning optimization and adapts the learned model to new low-resource MO tasks. Meta-MO was shown to consistently outperform several pretraining and multitask training procedures, providing an average improvement in the success rate of 4.3% on a large-scale bioactivity data set with diverse target variations. We also observed that Meta-MO resulted in the best performing models across fine-tuning sets with only dozens of samples. To the best of our knowledge, this is the first study to apply meta learning to MO tasks. More importantly, such a strategy could be further extended to many low-resource scenarios in real-world drug design.

引用

页码：1627 / 1636

页数：10

共 36 条

[1] Low Data Drug Discovery with One-Shot Learning [J].

Altae-Tran, Han ;

Ramsundar, Bharath ;

Pappu, Aneesh S. ;

Pande, Vijay .

ACS CENTRAL SCIENCE, 2017, 3 (04) :283-293

[2] SMILES-based deep generative scaffold decorator for de-novo drug design [J].

Arus-Pous, Josep ;

Patronov, Atanas ;

Bjerrum, Esben Jannik ;

Tyrchan, Christian ;

Reymond, Jean-Louis ;

Chen, Hongming ;

Engkvist, Ola .

JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)

[3]

Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473

[4]

Chen J., 2020, BIORXIV, DOI [10.1101/2020.06.24.169011, DOI 10.1101/2020.06.24.169011]

[5] Fragment-based approach to drug lead discovery - Overview and advances in various techniques [J].

Fattori, Daniela ;

Squarcia, Antonella ;

Bartoli, Sandra .

DRUGS IN R&D, 2008, 9 (04) :217-227

[6]

Finn C, 2017, PR MACH LEARN RES, V70

[7]

Fu TF, 2020, AAAI CONF ARTIF INTE, V34, P638

[8]

Graves A., 2012, ARXIV3711

[9] Constrained Bayesian optimization for automatic chemical design using variational autoencoders [J].

Griffiths, Ryan-Rhys ;

Hernandez-Lobato, Jose Miguel .

CHEMICAL SCIENCE, 2020, 11 (02) :577-586

[10] DOGS: Reaction-Driven de novo Design of Bioactive Compounds [J].

Hartenfeller, Markus ;

Zettl, Heiko ;

Walter, Miriam ;

Rupp, Matthias ;

Reisen, Felix ;

Proschak, Ewgenij ;

Weggen, Sascha ;

Stark, Holger ;

Schneider, Gisbert .

PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (02)

← 1 2 3 4 →