3D graph contrastive learning for molecular property prediction

被引:16
作者
Moon, Kisung [1 ]
Im, Hyeon-Jin [1 ]
Kwon, Sunyoung [1 ,2 ,3 ]
机构
[1] Pusan Natl Univ, Dept Informat Convergence Engn, Yangsan, South Korea
[2] Pusan Natl Univ, Sch Biomed Convergence Engn, Yangsan 50612, South Korea
[3] Pusan Natl Univ, Ctr Artificial Intelligence Res, Busan 46241, South Korea
基金
新加坡国家研究基金会;
关键词
DATABASE;
D O I
10.1093/bioinformatics/btad371
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Self-supervised learning (SSL) is a method that learns the data representation by utilizing supervision inherent in the data. This learning method is in the spotlight in the drug field, lacking annotated data due to time-consuming and expensive experiments. SSL using enormous unlabeled data has shown excellent performance for molecular property prediction, but a few issues exist. (i) Existing SSL models are large-scale; there is a limitation to implementing SSL where the computing resource is insufficient. (ii) In most cases, they do not utilize 3D structural information for molecular representation learning. The activity of a drug is closely related to the structure of the drug molecule. Nevertheless, most current models do not use 3D information or use it partially. (iii) Previous models that apply contrastive learning to molecules use the augmentation of permuting atoms and bonds. Therefore, molecules having different characteristics can be in the same positive samples. We propose a novel contrastive learning framework, small-scale 3D Graph Contrastive Learning (3DGCL) for molecular property prediction, to solve the above problems.Results 3DGCL learns the molecular representation by reflecting the molecule's structure through the pretraining process that does not change the semantics of the drug. Using only 1128 samples for pretrain data and 0.5 million model parameters, we achieved state-of-the-art or comparable performance in six benchmark datasets. Extensive experiments demonstrate that 3D structural information based on chemical knowledge is essential to molecular representation learning for property prediction.Availability and implementationData and codes are available in .
引用
收藏
页数:9
相关论文
共 50 条
[21]   Assigning confidence to molecular property prediction [J].
Nigam, AkshatKumar ;
Pollice, Robert ;
Hurley, Matthew F. D. ;
Hickman, Riley J. ;
Aldeghi, Matteo ;
Yoshikawa, Naruki ;
Chithrananda, Seyone ;
Voelz, Vincent A. ;
Aspuru-Guzik, Alan .
EXPERT OPINION ON DRUG DISCOVERY, 2021, 16 (09) :1009-1023
[22]   RNAComposer and RNA 3D structure prediction for nanotechnology [J].
Biesiada, Marcin ;
Pachulska-Wieczorek, Katarzyna ;
Adamiak, Ryszard W. ;
Purzycka, Katarzyna J. .
METHODS, 2016, 103 :120-127
[23]   FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction [J].
Cai, Hanxuan ;
Zhang, Huimin ;
Zhao, Duancheng ;
Wu, Jingxing ;
Wang, Ling .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
[24]   Molecular pretraining models towards molecular property prediction [J].
Qiao, Jianbo ;
Gao, Wenjia ;
Jin, Junru ;
Wang, Ding ;
Guo, Xu ;
Manavalan, Balachandran ;
Wei, Leyi .
SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (07)
[25]   Disentangled Representation Learning for 3D Face Shape [J].
Jiang, Zi-Hang ;
Wu, Qianyi ;
Chen, Keyu ;
Zhang, Juyong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11949-11958
[26]   Review: Deep Learning on 3D Point Clouds [J].
Bello, Saifullahi Aminu ;
Yu, Shangshu ;
Wang, Cheng ;
Adam, Jibril Muhmmad ;
Li, Jonathan .
REMOTE SENSING, 2020, 12 (11)
[27]   Multilinear Autoencoder for 3D Face Model Learning [J].
Abrevaya, Victoria Fernandez ;
Wuhrer, Stefanie ;
Boyer, Edmond .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :1-9
[28]   Capture, Learning, and Synthesis of 3D Speaking Styles [J].
Cudeiro, Daniel ;
Bolkart, Timo ;
Laidlaw, Cassidy ;
Ranjan, Anurag ;
Black, Michael J. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10093-10103
[29]   Positional embeddings and zero-shot learning using BERT for molecular-property prediction [J].
Mswahili, Medard Edmund ;
Hwang, Junha ;
Rajapakse, Jagath C. ;
Jo, Kyuri ;
Jeong, Young-Seob .
JOURNAL OF CHEMINFORMATICS, 2025, 17 (01)
[30]   A multiscale molecular structural neural network for molecular property prediction [J].
Shi, Zhiwei ;
Ma, Miao ;
Ning, Hanyang ;
Yang, Bo ;
Dang, Jingshuang .
MOLECULAR DIVERSITY, 2025, 29 (04) :3273-3292