A graph representation of molecular ensembles for polymer property prediction

被引:71
作者
Aldeghi, Matteo [1 ]
Coley, Connor W. [1 ,2 ]
机构
[1] MIT, Dept Chem Engn, Cambridge, MA 02139 USA
[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
基金
美国国家卫生研究院;
关键词
DRUG-DELIVERY; INFORMATICS; COPOLYMERS; MICELLES; RESOURCE; WEIGHT; DESIGN;
D O I
10.1039/d2sc02839e
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Synthetic polymers are versatile and widely used materials. Similar to small organic molecules, a large chemical space of such materials is hypothetically accessible. Computational property prediction and virtual screening can accelerate polymer design by prioritizing candidates expected to have favorable properties. However, in contrast to organic molecules, polymers are often not well-defined single structures but an ensemble of similar molecules, which poses unique challenges to traditional chemical representations and machine learning approaches. Here, we introduce a graph representation of molecular ensembles and an associated graph neural network architecture that is tailored to polymer property prediction. We demonstrate that this approach captures critical features of polymeric materials, like chain architecture, monomer stoichiometry, and degree of polymerization, and achieves superior accuracy to off-the-shelf cheminformatics methodologies. While doing so, we built a dataset of simulated electron affinity and ionization potential values for >40k polymers with varying monomer composition, stoichiometry, and chain architecture, which may be used in the development of other tailored machine learning approaches. The dataset and machine learning models presented in this work pave the path toward new classes of algorithms for polymer informatics and, more broadly, introduce a framework for the modeling of molecular ensembles.
引用
收藏
页码:10486 / 10498
页数:13
相关论文
共 99 条
[1]  
A Community Resource for Innovation in Polymer Technology, US
[2]   Gradient copolymers - Preparation, properties and practice [J].
Alam, Md Mahbub ;
Jack, Kevin S. ;
Hill, David J. T. ;
Whittaker, Andrew K. ;
Peng, Hui .
EUROPEAN POLYMER JOURNAL, 2019, 116 :394-414
[3]   Random Forest Predictor for Diblock Copolymer Phase Behavior [J].
Arora, Akash ;
Lin, Tzyy-Shyang ;
Rebello, Nathan J. ;
Av-Ron, Sarah H. M. ;
Mochigase, Hidenobu ;
Olsen, Bradley D. .
ACS MACRO LETTERS, 2021, 10 (11) :1339-1345
[4]   Polymer Informatics: Opportunities and Challenges [J].
Audus, Debra J. ;
de Pablo, Juan J. .
ACS MACRO LETTERS, 2017, 6 (10) :1078-1082
[5]   Accelerated Discovery of Organic Polymer Photocatalysts for Hydrogen Evolution from Water through the Integration of Experiment and Theory [J].
Bai, Yang ;
Wilbraham, Liam ;
Slater, Benjamin J. ;
Zwijnenburg, Martijn A. ;
Sprick, Reiner Sebastian ;
Cooper, Andrew I. .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2019, 141 (22) :9063-9071
[6]  
Bannigan P., 2022, CHEMRXIV, DOI DOI 10.26434/CHEMRXIV-2021-MXRXW-V2
[7]   Machine learning directed drug formulation development [J].
Bannigan, Pauric ;
Aldeghi, Matteo ;
Bao, Zeqing ;
Hase, Florian ;
Aspuru-Guzik, Alan ;
Allen, Christine .
ADVANCED DRUG DELIVERY REVIEWS, 2021, 175
[8]   Extendedtight-bindingquantum chemistry methods [J].
Bannwarth, Christoph ;
Caldeweyher, Eike ;
Ehlert, Sebastian ;
Hansen, Andreas ;
Pracht, Philipp ;
Seibert, Jakob ;
Spicher, Sebastian ;
Grimme, Stefan .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2021, 11 (02)
[9]   Designing exceptional gas-separation polymer membranes using machine learning [J].
Barnett, J. Wesley ;
Bilchak, Connor R. ;
Wang, Yiwen ;
Benicewicz, Brian C. ;
Murdock, Laura A. ;
Bereau, Tristan ;
Kumar, Sanat K. .
SCIENCE ADVANCES, 2020, 6 (20)
[10]   DENSITY-FUNCTIONAL THERMOCHEMISTRY .3. THE ROLE OF EXACT EXCHANGE [J].
BECKE, AD .
JOURNAL OF CHEMICAL PHYSICS, 1993, 98 (07) :5648-5652