Evaluating Self-Supervised Learning for Molecular Graph Embeddings

被引:0
作者
Wang, Hanchen
Kaddour, Jean [1 ]
Liu, Shengchao [2 ,3 ]
Tang, Jian [2 ,4 ,5 ]
Lasenby, Joan
Liu, Qi [6 ]
机构
[1] UCL, London, England
[2] MILA, Montreal, PQ, Canada
[3] UdeM, Montreal, PQ, Canada
[4] HEC, Montreal, PQ, Canada
[5] CIFAR, Toronto, ON, Canada
[6] HKU, Hong Kong, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
MEDICINAL CHEMISTRY; DRUG;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Self-Supervised Learning (GSSL) provides a robust pathway for acquiring embeddings without expert labelling, a capability that carries profound implications for molecular graphs due to the staggering number of potential molecules and the high cost of obtaining labels. However, GSSL methods are designed not for optimisation within a specific domain but rather for transferability across a variety of downstream tasks. This broad applicability complicates their evaluation. Addressing this challenge, we present "Molecular Graph Representation Evaluation" (MOLGRAPHEVAL), generating detailed profiles of molecular graph embeddings with interpretable and diversified attributes. MOLGRAPHEVAL offers a suite of probing tasks grouped into three categories: (i) generic graph, (ii) molecular substructure, and (iii) embedding space properties. By leveraging MOLGRAPHEVAL to benchmark existing GSSL methods against both current downstream datasets and our suite of tasks, we uncover significant inconsistencies between inferences drawn solely from existing datasets and those derived from more nuanced probing. These findings suggest that current evaluation methodologies fail to capture the entirety of the landscape.
引用
收藏
页数:33
相关论文
共 120 条
[1]  
Afonina Valentina A, 2019, J CHEM INFORM MODELI
[2]  
Akhondzadeh Mohammad Sadegh, 2023, AISTATS
[3]   Biomedical Applications of Aromatic Azo Compounds [J].
Ali, Yousaf ;
Abd Hamid, Shafida ;
Rashid, Umer .
MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2018, 18 (18) :1548-1558
[4]  
[Anonymous], 2018, NeurIPS
[5]  
[Anonymous], 2020, AAAI CONF ARTIF INTE
[6]  
[Anonymous], 2017, ICML
[7]  
[Anonymous], CVPR, DOI DOI 10.1038/S41416-021-01597-2
[8]   GEOM, energy-annotated molecular conformations for property prediction and molecular generation [J].
Axelrod, Simon ;
Gomez-Bombarelli, Rafael .
SCIENTIFIC DATA, 2022, 9 (01)
[9]   Catalytic allylic oxidation of internal alkenes to a multifunctional chiral building block [J].
Bayeh, Liela ;
Le, Phong Q. ;
Tambar, Uttam K. .
NATURE, 2017, 547 (7662) :196-+
[10]  
Beachy Michael D, 1997, JACS