Few-shot learning via graph embeddings with convolutional networks for low-data molecular property prediction

被引:6
作者
Torres, Luis [1 ]
Arrais, Joel P. [1 ]
Ribeiro, Bernardete [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Coimbra, Portugal
关键词
Few-shot learning; Convolutional neural networks; Graph neural networks; Molecular property prediction; DRUG; DISCOVERY;
D O I
10.1007/s00521-023-08403-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph neural networks and convolutional architectures have proven to be pivotal in improving the prediction of molecular properties in drug discovery. However, this is fundamentally a low data problem that is incompatible with regular deep learning approaches. Contemporary deep networks require large amounts of training data, which severely limits the prediction of new molecular entities from limited available data. In this paper, we address the challenge of low data in molecular property prediction by: (1) defining a set of deep learning architectures that accept compound chemical structures in the form of molecular graphs, (2) creating a few-shot learning strategy across graph neural networks and convolutional neural networks to leverage the rich information of graph embeddings, and (3) proposing a two-module meta-learning framework to learn from task-transferable knowledge and predict molecular properties on few-shot data. Furthermore, we conduct multiple experiments on two benchmark multiproperty datasets to demonstrate a superior performance over conventional graph-based baselines. ROC-AUC results for 10-shot experiments show an average improvement of +11.37% on Tox21 and +0.53% on SIDER, which are representative small-sized biological datasets for molecular property prediction.
引用
收藏
页码:13167 / 13185
页数:19
相关论文
共 53 条
[1]   Deep Transferable Compound Representation across Domains and Tasks for Low Data Drug Discovery [J].
Abbasi, Karim ;
Poso, Antti ;
Ghasemi, Jahanbakhsh ;
Amanlou, Massoud ;
Masoudi-Nejad, Ali .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (11) :4528-4539
[2]   Low Data Drug Discovery with One-Shot Learning [J].
Altae-Tran, Han ;
Ramsundar, Bharath ;
Pappu, Aneesh S. ;
Pande, Vijay .
ACS CENTRAL SCIENCE, 2017, 3 (04) :283-293
[3]  
Arora R., 2016, P 6 INT C LEARN REPR, DOI 10.48550/arXiv.1611.01491
[4]   Transfer Learning for Drug Discovery [J].
Cai, Chenjing ;
Wang, Shiwei ;
Xu, Youjun ;
Zhang, Weilin ;
Tang, Ke ;
Ouyang, Qi ;
Lai, Luhua ;
Pei, Jianfeng .
JOURNAL OF MEDICINAL CHEMISTRY, 2020, 63 (16) :8683-8694
[5]   Convolutional Embedding of Attributed Molecular Graphs for Physical Property Prediction [J].
Coley, Connor W. ;
Barzilay, Regina ;
Green, William H. ;
Jaakkola, Tommi S. ;
Jensen, Klavs F. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (08) :1757-1772
[6]  
Defferrard M, 2016, ADV NEUR IN, V29
[7]   META-DDIE: predicting drug-drug interaction events with few-shot learning [J].
Deng, Yifan ;
Qiu, Yang ;
Xu, Xinran ;
Liu, Shichao ;
Zhang, Zhongfei ;
Zhu, Shanfeng ;
Zhang, Wen .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
[8]   Graph Prototypical Networks for Few-shot Learning on Attributed Networks [J].
Ding, Kaize ;
Wang, Jianling ;
Li, Jundong ;
Shu, Kai ;
Liu, Chenghao ;
Liu, Huan .
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, :295-304
[9]  
Duan Y, 2017, Adv Neural Inf Process Syst., V2017, P1088, DOI 10.48550/arXiv.1703.07326
[10]   Improvement in ADMET Prediction with Multitask Deep Featurization [J].
Feinberg, Evan N. ;
Joshi, Elizabeth ;
Pande, Vijay S. ;
Cheng, Alan C. .
JOURNAL OF MEDICINAL CHEMISTRY, 2020, 63 (16) :8835-8848