Molecular representation contrastive learning via transformer embedding to graph neural networks

被引:0
|
作者
Liu, Yunwu [1 ]
Zhang, Ruisheng [1 ]
Li, Tongfeng [1 ]
Jiang, Jing [1 ]
Ma, Jun [1 ]
Yuan, Yongna [1 ]
Wang, Ping [1 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Tianshui Rd, Lanzhou 730000, Peoples R China
基金
中国国家自然科学基金;
关键词
Molecular machine learning; Contrastive learning; Graph neural networks; Augmentation methods; Molecular property prediction; DISCOVERY;
D O I
10.1016/j.asoc.2024.111970
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Molecular property prediction has shown great performance using graph neural networks (GNNs). However, due to the lack of expansion potential and the scarcity of available labeling data, GNNs are unable to generate appropriate molecular representation. In this study, we propose MolFG, a new contrastive learning (CL) pre-training framework for predicting molecular properties. Meanwhile, we also propose FormerGraph, an effective molecular graph representation strategy, aiming to devise an effective method for learning information regarding molecular features. After pre-training on 10 million unlabeled molecules and then fine-tuning multiple types of downstream tasks to predict molecular properties. The encouraging results revealed that MolFG could effectively extract meaningful chemical insights to generate interpretable representations and differentiate chemically plausible molecular similarities. On most molecular benchmark datasets, MolFG rivals or surpasses supervised learning methods with sophisticated feature engineering. Compared to the previous best supervised model, MolFG demonstrates an average 7.5% gain in ROC-AUC on 7 classification tasks and a 1.9% decrease in scaled average error on 6 regression tasks. Numerous experimental outcomes on downstream tasks demonstrate that the MolFG model can significantly enhance its effectiveness in predicting molecular properties.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Molecular contrastive learning of representations via graph neural networks
    Yuyang Wang
    Jianren Wang
    Zhonglin Cao
    Amir Barati Farimani
    Nature Machine Intelligence, 2022, 4 : 279 - 287
  • [2] Molecular contrastive learning of representations via graph neural networks
    Wang, Yuyang
    Wang, Jianren
    Cao, Zhonglin
    Farimani, Amir Barati
    NATURE MACHINE INTELLIGENCE, 2022, 4 (03) : 279 - 287
  • [3] Molecular Representation Learning via Heterogeneous Motif Graph Neural Networks
    Yu, Zhaoning
    Gao, Hongyang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Transferable Implicit Solvation via Contrastive Learning of Graph Neural Networks
    Airas, Justin
    Ding, Xinqiang
    Zhang, Bin
    ACS CENTRAL SCIENCE, 2023, 9 (12) : 2286 - 2297
  • [5] Inferring Gene Regulatory Networks via Directed Graph Contrastive Representation Learning
    Long, Kaifu
    Qu, Luxuan
    Wang, Weiyiqi
    Wang, Zhiqiong
    Wang, Mingcan
    Xin, Junchang
    KNOWLEDGE-BASED SYSTEMS, 2025, 316
  • [6] Dynamic Representation Learning via Recurrent Graph Neural Networks
    Zhang, Chun-Yang
    Yao, Zhi-Liang
    Yao, Hong-Yu
    Huang, Feng
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (02): : 1284 - 1297
  • [7] Contrastive Document Representation Learning with Graph Attention Networks
    Xu, Peng
    Chen, Xinchi
    Ma, Xiaofei
    Huang, Zhiheng
    Xiang, Bing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3874 - 3884
  • [8] Boosting Patient Representation Learning via Graph Contrastive Learning
    Zhang, Zhenhao
    Liu, Yuxi
    Bian, Jiang
    Yepes, Antonio Jimeno
    Shen, Jun
    Li, Fuyi
    Long, Guodong
    Salim, Flora D.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-APPLIED DATA SCIENCE TRACK, PT IX, ECML PKDD 2024, 2024, 14949 : 335 - 350
  • [9] Contrastive learning enhanced by graph neural networks for Universal Multivariate Time Series Representation
    Wang, Xinghao
    Xing, Qiang
    Xiao, Huimin
    Ye, Ming
    INFORMATION SYSTEMS, 2024, 125
  • [10] Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering
    Xia, Wei
    Wang, Tianxiu
    Gao, Quanxue
    Yang, Ming
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1170 - 1183