Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data

被引:16
作者
Armeli, Gianluca [1 ]
Peters, Jan-Hendrik [1 ]
Koop, Thomas [1 ]
机构
[1] Bielefeld Univ, Fac Chem, D-33615 Bielefeld, Germany
关键词
UNIFAC MODEL; VISCOSITY; AEROSOLS; MIXTURES; STATE; WATER; PURE;
D O I
10.1021/acsomega.2c08146
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of organic compounds present in aerosol particles, for only a minor fraction of them experimental glass transition temperatures are known. Therefore, we have developed a machine learning model designed to predict the glass transition temperature of organic molecular compounds based on molecule-derived input variables. The extremely randomized trees (extra trees) procedure was chosen for this purpose. Two approaches using different sets of input variables were followed. The first one uses the number of selected functional groups present in the compound, while the second one generates descriptors from a SMILES (Simplified Molecular Input Line Entry System) string. Organic compounds containing carbon, hydrogen, oxygen, nitrogen, and halogen atoms are included. For improved results, both approaches can be combined with the melting temperature of the compound as an additional input variable. The results show that the predictions of both approaches show a similar mean absolute error of about 12-13 K, with the SMILES-based predictions performing slightly better. In general, the model shows good predictive power considering the diversity of the experimental input data. Furthermore, we also show that its performance exceeds that of previous parameterizations developed for this purpose and also performs better than existing machine learning models. In order to provide user-friendly versions of the model for applications, we have developed a web site where the model can be run by interested scientists via a web-based interface without prior technical knowledge. We also provide Python code of the model. Additionally, all experimental input data are provided in form of the Bielefeld Molecular Organic Glasses (BIMOG) database. We believe that this model is a powerful tool for many applications in atmospheric aerosol science and material science.
引用
收藏
页码:12298 / 12309
页数:12
相关论文
共 56 条
[1]   Investigation on Aromaticity Index and Double-Bond Equivalent of Aromatic Compounds and Ionic Liquids for Fuel Desulphurization [J].
Abdullah, Syamsul B. ;
Man, Z. ;
Bustam, M. A. .
JOURNAL OF CHEMISTRY, 2013, 2013
[2]   Explainable Machine Learning Algorithms For Predicting Glass Transition Temperatures [J].
Alcobaca, Edesio ;
Mastelini, Saulo Martiello ;
Botari, Tiago ;
Pimentel, Bruno Almeida ;
Cassar, Daniel Roberto ;
de Leon Ferreira de Carvalho, Andre Carlos Ponce ;
Zanotto, Edgar Dutra .
ACTA MATERIALIA, 2020, 188 :92-100
[3]   Machine learning prediction of glass transition temperature of conjugated polymers from chemical structure [J].
Alesadi, Amirhadi ;
Cao, Zhiqiang ;
Li, Zhaofan ;
Zhang, Song ;
Zhao, Haoyu ;
Gu, Xiaodan ;
Xia, Wenjie .
CELL REPORTS PHYSICAL SCIENCE, 2022, 3 (06)
[4]   Multicollinearity [J].
Alin, Aylin .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (03) :370-374
[5]   Experimental and Computational Prediction of Glass Transition Temperature of Drugs [J].
Alzghoul, Ahmad ;
Alhalaweh, Amjad ;
Mahlin, Denny ;
Bergstrom, Christel A. S. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (12) :3396-3403
[6]   FORMATION OF GLASSES FROM LIQUIDS AND BIOPOLYMERS [J].
ANGELL, CA .
SCIENCE, 1995, 267 (5206) :1924-1935
[7]  
Anthony M., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P218, DOI 10.1145/279943.279987
[8]   RELATION BETWEEN (APPARENT) 2ND-ORDER TRANSITION TEMPERATURE AND MELTING POINT [J].
BEAMAN, RG .
JOURNAL OF POLYMER SCIENCE, 1952, 9 (05) :470-472
[9]   Kinetic regimes and limiting cases of gas uptake and heterogeneous reactions in atmospheric aerosols and clouds: a general classification scheme [J].
Berkemeier, T. ;
Huisman, A. J. ;
Ammann, M. ;
Shiraiwa, M. ;
Koop, T. ;
Poeschl, U. .
ATMOSPHERIC CHEMISTRY AND PHYSICS, 2013, 13 (14) :6663-6686