QSPR for the prediction of critical micelle concentration of different classes of surfactants using machine learning algorithms

被引:8
作者
Boukelkal, Nada [1 ]
Rahal, Soufiane [1 ]
Rebhi, Redha [1 ]
Hamadache, Mabrouk [1 ]
机构
[1] Univ Yahia Fares, Fac Technol, Dept Proc Engn & Environm, Biomat & Transport Phenomena Lab LBMPT, Medea 26000, Algeria
关键词
CMC; QSPR; RFR; Surfactants; SVR; STRUCTURE-PROPERTY RELATIONSHIP; SUGAR-BASED SURFACTANTS; ATOM ETA INDEXES; ANIONIC SURFACTANTS; EXTENDED SURFACTANTS; CHAIN-LENGTH; QSAR MODELS; VALIDATION; CARBOXYLATE; PARAMETERS;
D O I
10.1016/j.jmgm.2024.108757
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The determination of the critical micelle concentration (CMC) is a crucial factor when evaluating surfactants, making it an essential tool in studying the properties of surfactants in various industrial fields. In this present research, we assembled a comprehensive set of 593 different classes of surfactants including, anionic, cationic, nonionic, zwitterionic, and Gemini surfactants to establish a link between their molecular structure and the negative logarithmic value of critical micelle concentration (pCMC) utilizing quantitative structure -property relationship (QSPR) methodologies. Statistical analysis revealed that a set of 14 significant Mordred descriptors (SlogP, GATS6d, nAcid, GATS8dv, GATS4dv, PEOE_VSA11, GATS8d, ATS0p, GATS1d, MATS5p, GATS3d, NdssC, GATS6dv and EState_VSA4), along with temperature, served as appropriate inputs. Different machine learning methods, such as multiple linear regression (MLR), random forest regression (RFR), artificial neural network (ANN), and support vector regression (SVM), were employed in this study to build QSPR models. According to the statistical coefficients of QSPR models, SVR with Dragonfly hyperparameter optimization (SVRDA) was the most accurate in predicting pCMC values, achieving ( R 2 = 0.9740, Q 2 = 0.9739, r 2 m = 0.9627, and Delta r 2 m = 0.0244) for the entire dataset.
引用
收藏
页数:8
相关论文
共 59 条
[1]   Critical micelle concentrations of different classes of surfactants: A Quantitative Structure Property Relationship study [J].
Anoune, N ;
Nouiri, M ;
Berrah, Y ;
Gauvrit, JY ;
Lanteri, P .
JOURNAL OF SURFACTANTS AND DETERGENTS, 2002, 5 (01) :45-53
[2]   Developing an ANFIS-based swarm concept model for estimating the relative viscosity of nanofluids [J].
Baghban, Alireza ;
Jalali, Ali ;
Shafiee, Mojtaba ;
Ahmadi, Mohammad Hossein ;
Chau, Kwok-wing .
ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2019, 13 (01) :26-39
[3]   On the prediction of critical micelle concentration for sugar-based non-ionic surfactants [J].
Baghban, Alireza ;
Sasanipour, Jafar ;
Sarafbidabad, Mohsen ;
Piri, Amin ;
Razavi, Razieh .
CHEMISTRY AND PHYSICS OF LIPIDS, 2018, 214 :46-57
[4]  
Ben-Hur A, 2010, METHODS MOL BIOL, V609, P223, DOI 10.1007/978-1-60327-241-4_13
[5]   QSAR model for prediction of the therapeutic potency of N-benzylpiperidine derivatives as AChE inhibitors [J].
Bitam, S. ;
Hamadache, M. ;
Hanini, S. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2017, 28 (06) :471-489
[6]   2D-QSAR, docking, molecular dynamics, studies of PF-07321332 analogues to identify alternative inhibitors against 3CLpro enzyme in SARS-CoV disease [J].
Bitam, Said ;
Hamadache, Mabrouk ;
Hanini, Salah .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2023, 41 (14) :6991-7000
[7]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[8]   Advances in surfactants for agrochemicals [J].
Castro, Mariano J. L. ;
Ojeda, Carlos ;
Fernandez Cirelli, Alicia .
ENVIRONMENTAL CHEMISTRY LETTERS, 2014, 12 (01) :85-95
[9]   In Silico Log P Prediction for a Large Data Set with Support Vector Machines, Radial Basis Neural Networks and Multiple Linear Regression [J].
Chen, Hai-Feng .
CHEMICAL BIOLOGY & DRUG DESIGN, 2009, 74 (02) :142-147
[10]   Real External Predictivity of QSAR Models: How To Evaluate It? Comparison of Different Validation Criteria and Proposal of Using the Concordance Correlation Coefficient [J].
Chirico, Nicola ;
Gramatica, Paola .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (09) :2320-2335