Two-step hybrid modeling for variable selection and estimation: An application to quantitative structure activity relationship study

被引:2
作者
Oranye, Henrietta Ebele [1 ,2 ]
Ugwuowo, Fidelis Ifeanyi [1 ]
Arum, Kingsley Chinedu [1 ]
机构
[1] Univ Nigeria, Dept Stat, Nsukka, Nigeria
[2] Univ Nigeria, Dept Stat, Nsukka, Enugu, Nigeria
关键词
cross-validation; jackknife; molecular descriptors; random forest; variable selection; ADAPTIVE LASSO; REGRESSION; QSAR; CLASSIFICATION;
D O I
10.1002/cem.3522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, we developed a simple technique for effective parameter estimation and prediction of the quantitative structure activity relationship studies using a two-step procedure. The first step is to choose the important molecular descriptors using the random forest regression, and the second step is to optimally predict the biological activity of the selected chemical compounds using the following estimators: ridge regression, jackknife ridge, Liu regression, jackknife Liu, Kibria-Lukman, and jackknife Kibria-Lukman. We conducted a simulation study and a real-life analysis with a quantitative structure-activity relationship (QSAR) data with 2540 descriptors after preprocessing. The optimal prediction is determined using the cross-validation error. The estimator with minimum cross-validation error is considered best. It is obvious that performing jackknife estimation after random forest selection is preferred. In this study, we developed a simple technique for effective parameter estimation and prediction of the quantitative structure activity relationship studies (QSAR) using a two-step procedure. We conducted a simulation study and a real-life application with QSAR data with 2540 descriptors after preprocessing. The optimal prediction is determined using the cross-validation error. The performance of the methods is judged using the root mean squared error of prediction. It is obvious that performing jackknife estimation after random forest selection is preferred.
引用
收藏
页数:9
相关论文
共 50 条
[31]   Modeling the Cellular Uptake of Magnetofluorescent Nanoparticles in Pancreatic Cancer Cells: A Quantitative Structure Activity Relationship Study [J].
Ghorbanzadeh, Mehdi ;
Fatemi, Mohammad H. ;
Karimpour, Masoumeh .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2012, 51 (32) :10712-10718
[32]   Quantitative structure-activity relationship study of bitter peptides [J].
Kim, Hyun-Ock ;
Li-Chan, Eunice C. Y. .
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2006, 54 (26) :10102-10111
[33]   Quantitative structure-activity relationship study of antitubercular fluoroquinolones [J].
Minovski, Nikola ;
Vracko, Marjan ;
Solmajer, Tom .
MOLECULAR DIVERSITY, 2011, 15 (02) :417-426
[34]   Quantitative Structure - Activity Relationship Study on Saponins as Cytotoxicity Enhancers [J].
Gevrenova, Reneta ;
Weng, Alexander ;
Voutguenne-Nazabadioko, Laurence ;
Thakur, Mayank ;
Doytchinova, Irini .
LETTERS IN DRUG DESIGN & DISCOVERY, 2015, 12 (03) :166-171
[35]   Quantitative Structure-Activity Relationship Study on Pyrrolotriazine Derivatives as Met Kinase Inhibitors [J].
Sharma, B. K. ;
Yashwant ;
Srivastava, B. .
ASIAN JOURNAL OF CHEMISTRY, 2010, 22 (10) :8231-8245
[36]   Investigations on Inhibitors of Hedgehog Signal Pathway: A Quantitative Structure-Activity Relationship Study [J].
Zhu, Ruixin ;
Liu, Qi ;
Tang, Jian ;
Li, Huiliang ;
Cao, Zhiwei .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2011, 12 (05) :3018-3033
[37]   Pro-apoptotic properties of parthenin analogs: a quantitative structure–activity relationship study [J].
Rukmankesh Mehra ;
Amit Nargotra ;
Bhahwal A. Shah ;
Subhash C. Taneja ;
Ram A. Vishwakarma ;
Surrinder Koul .
Medicinal Chemistry Research, 2013, 22 :2303-2311
[38]   Quantitative structure-activity relationship study on the inhibitors of fatty acid amide hydrolase [J].
Lu, Peng ;
Zhang, Ruisheng ;
Yuan, Yongna ;
Gong, Zhiguo .
JOURNAL OF CHEMOMETRICS, 2010, 24 (9-10) :565-573
[39]   Quantitative Structure Activity Relationship Modeling for Predicting Radiosensitization Effectiveness of Nitroimidazole Compounds [J].
Long, Wei ;
Liu, Peixun .
JOURNAL OF RADIATION RESEARCH, 2010, 51 (05) :563-572
[40]   Quantitative structure-activity relationship modeling of bioconcentration factors of polychlorinated biphenyls [J].
Katritzky A.R. ;
Radzvilovits M. ;
Slavov S. ;
Kasemets K. ;
Tamm K. ;
Karelson M. .
Toxicological and Environmental Chemistry, 2010, 92 (07) :1233-1247