Self-supervised graph neural networks for polymer property prediction

被引:2
作者
Gao, Qinghe [1 ]
Dukker, Tammo [1 ]
Schweidtmann, Artur M. [1 ]
Weber, Jana M. [2 ]
机构
[1] Delft Univ Technol, Dept Chem Engn, Proc Intelligence Res Grp, Van der Maasweg 9, NL-2629 HZ Delft, Netherlands
[2] Delft Univ Technol, Dept Intelligent Syst, Pattern Recognit & Bioinformat, Van Mourik Broekmanweg 6, NL-2628 XE Delft, Netherlands
关键词
DATABASE;
D O I
10.1039/d4me00088a
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The estimation of polymer properties is of crucial importance in many domains such as energy, healthcare, and packaging. Recently, graph neural networks (GNNs) have shown promising results for the prediction of polymer properties based on supervised learning. However, the training of GNNs in a supervised learning task demands a huge amount of polymer property data that is time-consuming and computationally/experimentally expensive to obtain. Self-supervised learning offers great potential to reduce this data demand through pre-training the GNNs on polymer structure data only. These pre-trained GNNs can then be fine-tuned on the supervised property prediction task using a much smaller labeled dataset. We propose to leverage self-supervised learning techniques in GNNs for the prediction of polymer properties. We employ a recent polymer graph representation that includes essential features of polymers, such as monomer combinations, stochastic chain architecture, and monomer stoichiometry, and process the polymer graphs through a tailored GNN architecture. We investigate three self-supervised learning setups: (i) node- and edge-level pre-training, (ii) graph-level pre-training, and (iii) ensembled node-, edge- & graph-level pre-training. We additionally explore three different transfer strategies of fully connected layers with the GNN architecture. Our results indicate that the ensemble node-, edge- & graph-level self-supervised learning with all layers transferred depicts the best performance across dataset size. In scarce data scenarios, it decreases the root mean square errors by 28.39% and 19.09% for the prediction of electron affinity and ionization potential compared to supervised learning without the pre-training task. Self-supervised learning for polymer property prediction in scarce data domains.
引用
收藏
页码:1130 / 1143
页数:14
相关论文
共 37 条
[1]   A graph representation of molecular ensembles for polymer property prediction [J].
Aldeghi, Matteo ;
Coley, Connor W. .
CHEMICAL SCIENCE, 2022, 13 (35) :10486-10498
[2]   Data-driven approaches for structure-property relationships in polymer science for prediction and understanding [J].
Amamoto, Yoshifumi .
POLYMER JOURNAL, 2022, 54 (08) :957-967
[3]  
Azizi S., 2021, 2021 IEEECVF INT C C
[4]   Predicting the Temperature Dependence of Surfactant CMCs Using Graph Neural Networks [J].
Brozos, Christoforos ;
Rittig, Jan G. ;
Bhattacharya, Sandip ;
Akanny, Elie ;
Kohlmann, Christina ;
Mitsos, Alexander .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, :5695-5707
[5]  
Chithrananda S., 2020, CHEMBERTA LARGE SCAL
[6]  
Gidaris S., 2018, P ICLR
[7]  
Gilmer J, 2017, arXiv
[8]   Polymer Informatics at Scale with Multitask Graph Neural Networks [J].
Gurnani, Rishi ;
Kuenneth, Christopher ;
Toland, Aubrey ;
Ramprasad, Rampi .
CHEMISTRY OF MATERIALS, 2023, 35 (04) :1560-1567
[9]   Some relations between reaction rates and equilibrium constants [J].
Hammett, LP .
CHEMICAL REVIEWS, 1935, 17 (01) :125-136
[10]   Chemprop: A Machine Learning Package for Chemical Property Prediction [J].
Heid, Esther ;
Greenman, Kevin P. ;
Chung, Yunsie ;
Li, Shih-Cheng ;
Graff, David E. ;
Vermeire, Florence H. ;
Wu, Haoyang ;
Green, William H. ;
Mcgill, Charles J. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 64 (01) :9-17