Beware of proper validation of models for ionic Liquids!

被引:35
作者
Makarov, D. M. [1 ]
Fadeeva, Yu A. [1 ]
Shmukler, L. E. [1 ]
Tetko, I., V [1 ,2 ,3 ]
机构
[1] Russian Acad Sci, GA Krestov Inst Solut Chem, Ivanovo, Russia
[2] Res Ctr Environm Hlth GmbH, Inst Struct Biol, Helmholtz Zentrum Munchen, Ingolstadter Landstr 1, D-85764 Neuherberg, Germany
[3] BIGCHEM GmbH, Valerystr 49, D-85716 Unterschleissheim, Germany
关键词
Ionic Liquids; Melting point; QSPR; OCHEM; NLP; Transformer-CNN; TRIETHANOLAMINE-BASED SALTS; MELTING-POINTS; IMIDAZOLIUM BROMIDES; THERMAL-PROPERTIES; TEMPERATURE; SOLVENTS; PREDICT; MIXTURES; ACIDS; QSPR;
D O I
10.1016/j.molliq.2021.117722
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The melting point (MP) of an ionic liquid (IL) is one of the key physical properties as it determines the lower limit of the IL working temperature range. In this work, we analysed the recently published studies to predict MP of ILs. While we were able to reproduce the statistical parameters reported by the authors, we found that the performance of the models with new test set data was much lower than the reported statistical values. The discrepancy was due to the validation protocol (random split of the initial set into training/test subsets) that did not allow correct estimation of how contributions of individual ions affect the model performance. Using a more rigorous validation protocol we reached good agreement between the training and test set statistical parameters. We strongly suggest using this protocol for proper validation of models for other properties of ILs to avoid reporting overoptimistic statistical parameters. We also showed that the Transformer Convolutional Neural Network, which was based on the representation of molecules as text (SMILES), proposed a model with significantly higher prediction accuracy as compared to those developed using descriptors that were used in the previous studies. The RMSE of this model is 44 degrees C and the model is applicable to any type of ILs. The data and developed models are publicly available online at http://ochem.eu/article/135195. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 58 条
[31]   Modeling of non-additive mixture properties using the Online CHEmical database and Modeling environment (OCHEM) [J].
Oprisiu, Ioana ;
Novotarskyi, Sergii ;
Tetko, Igor V. .
JOURNAL OF CHEMINFORMATICS, 2013, 5
[32]   QSPR Study on the Melting Points of a Diverse Set of Potential Ionic Liquids by Projection Pursuit Regression [J].
Ren, Yueying ;
Qin, Jin ;
Liu, Huanxiang ;
Yao, Xiaojun ;
Liu, Mancang .
QSAR & COMBINATORIAL SCIENCE, 2009, 28 (11-12) :1237-1244
[33]   Extended-Connectivity Fingerprints [J].
Rogers, David ;
Hahn, Mathew .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (05) :742-754
[34]   Ionic liquids - Solvents of the future? [J].
Rogers, RD ;
Seddon, KR .
SCIENCE, 2003, 302 (5646) :792-793
[35]   A review on created QSPR models for predicting ionic liquids properties and their reliability from chemometric point of view [J].
Sepehri, Bakhtyar .
JOURNAL OF MOLECULAR LIQUIDS, 2020, 297
[36]   The physicochemical properties and structure of alkylammonium protic ionic liquids of RnH4-nNX (n=1-3) family. A mini-review [J].
Shmukler, L. E. ;
Fedorova, I., V ;
Fadeeva, Yu A. ;
Safonova, L. P. .
JOURNAL OF MOLECULAR LIQUIDS, 2021, 321
[37]   Ionic liquids synthesis and applications: An overview [J].
Singh, Sandip K. ;
Savoy, Anthony W. .
JOURNAL OF MOLECULAR LIQUIDS, 2020, 297
[38]  
Stewart J. J. P., 2016, Stewart Computational Chemistry
[39]   Prediction of the melting points for two kinds of room temperature ionic liquids [J].
Sun, Ning ;
He, Xuezhong ;
Dong, Kun ;
Zhang, Xiangping ;
Lu, Xingmei ;
He, Hongyan ;
Zhang, Suojiang .
FLUID PHASE EQUILIBRIA, 2006, 246 (1-2) :137-142
[40]   ToxAlerts: A Web Server of Structural Alerts for Toxfic Chemicals and Compounds with Potential Adverse Reactions [J].
Sushko, Iurii ;
Salmina, Elena ;
Potemkin, Vladimir A. ;
Poda, Gennadiy ;
Tetko, Igor V. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (08) :2310-2316