Accurate machine learning models based on small dataset of energetic materials through spatial matrix featurization methods

被引:17
作者
Chen, Chao [1 ]
Liu, Danyang [1 ]
Deng, Siyan [1 ]
Zhong, Lixiang [1 ]
Chan, Serene Hay Yee [1 ]
Li, Shuzhou [1 ]
Hng, Huey Hoon [1 ]
机构
[1] Nanyang Technol Univ, Sch Mat Sci & Engn, 50 Nanyang Ave, Singapore 639798, Singapore
来源
JOURNAL OF ENERGY CHEMISTRY | 2021年 / 63卷
关键词
Small database machine learning; Energetic materials screening; Spatial matrix featurization method; Crystal density; Formation enthalpy; n-Body interactions; DENSITY-FUNCTIONAL THEORY; CUBANE DERIVATIVES; PREDICTION; ENTHALPIES; HEATS; APPROXIMATIONS; MOLECULES; ENERGIES; STORAGE; SALTS;
D O I
10.1016/j.jechem.2021.08.031
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
A large database is desired for machine learning (ML) technology to make accurate predictions of materials physicochemical properties based on their molecular structure. When a large database is not available, the development of proper featurization method based on physicochemical nature of target proprieties can improve the predictive power of ML models with a smaller database. In this work, we show that two new featurization methods, volume occupation spatial matrix and heat contribution spatial matrix, can improve the accuracy in predicting energetic materials' crystal density (rho(crystal)) and solid phase enthalpy of formation (H-f;solid) using a database containing 451 energetic molecules. Their mean absolute errors are reduced from 0.048 g/cm(3) and 24.67 kcal/mol to 0.035 g/cm(3) and 9.66 kcal/mol, respectively. By leave-one-out-cross-validation, the newly developed ML models can be used to determine the performance of most kinds of energetic materials except cubanes. Our ML models are applied to predict rho(crystal) and H-f,H- solid of CHON-based molecules of the 150 million sized PubChem database, and screened out 56 candidates with competitive detonation performance and reasonable chemical structures. With further improvement in future, spatial matrices have the potential of becoming multifunctional ML simulation tools that could provide even better predictions in wider fields of materials science. (C) 2021 Science Press and Dalian Institute of Chemical Physics, Chinese Academy of Sciences. Published by ELSEVIER B.V. and Science Press. All rights reserved.
引用
收藏
页码:364 / 375
页数:12
相关论文
共 62 条
[1]  
Akhavan J., 2011, CHEM EXPLOSIVES
[2]   Computational Thermochemistry: Scale Factor Databases and Scale Factors for Vibrational Frequencies Obtained from Electronic Model Chemistries [J].
Alecu, I. M. ;
Zheng, Jingjing ;
Zhao, Yan ;
Truhlar, Donald G. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2010, 6 (09) :2872-2887
[3]   A statistical method for assessment of the existing correlations of hydrate forming conditions [J].
Amin, Javad Sayyad ;
Nildchah, Somayye ;
Veiskarami, Mehdi .
JOURNAL OF ENERGY CHEMISTRY, 2015, 24 (01) :93-100
[4]   Ab initio solid-state heats of formation of molecular salts from ion packing and crystal modelling:: application to ammonium crystals [J].
Beaucamp, S ;
Bernand-Mantel, A ;
Mathieu, D ;
Agafonov, V .
MOLECULAR PHYSICS, 2004, 102 (03) :253-258
[5]   Cubane: 50 Years Later [J].
Biegasiewicz, Kyle F. ;
Griffiths, Justin R. ;
Savage, G. Paul ;
Tsanaktsidis, John ;
Priefer, Ronny .
CHEMICAL REVIEWS, 2015, 115 (14) :6719-6745
[6]   Chemically intuited, large-scale screening of MOFs by machine learning techniques [J].
Borboudakis, Giorgos ;
Stergiannakos, Taxiarchis ;
Frysali, Maria ;
Klontzas, Emmanuel ;
Tsamardinos, Ioannis ;
Froudakis, George E. .
NPJ COMPUTATIONAL MATERIALS, 2017, 3
[7]   Machine learning for molecular and materials science [J].
Butler, Keith T. ;
Davies, Daniel W. ;
Cartwright, Hugh ;
Isayev, Olexandr ;
Walsh, Aron .
NATURE, 2018, 559 (7715) :547-555
[8]   Improved prediction of heats of formation of energetic materials using quantum mechanical calculations [J].
Byrd, EFC ;
Rice, BM .
JOURNAL OF PHYSICAL CHEMISTRY A, 2006, 110 (03) :1005-1013
[9]   State-of-the-Art Calculations of Sublimation Enthalpies for Selected Molecular Crystals and Their Computational Uncertainty [J].
Cervinka, Ctirad ;
Fulem, Michal .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2017, 13 (06) :2840-2850
[10]   Energy Storage in Cubane Derivatives and Their Real-Time Decomposition: Computational Molecular Dynamics and Thermodynamics [J].
Chaban, Vitaly V. ;
Prezhdo, Oleg V. .
ACS ENERGY LETTERS, 2016, 1 (01) :189-194