Two-step hybrid modeling for variable selection and estimation: An application to quantitative structure activity relationship study

被引:2
作者
Oranye, Henrietta Ebele [1 ,2 ]
Ugwuowo, Fidelis Ifeanyi [1 ]
Arum, Kingsley Chinedu [1 ]
机构
[1] Univ Nigeria, Dept Stat, Nsukka, Nigeria
[2] Univ Nigeria, Dept Stat, Nsukka, Enugu, Nigeria
关键词
cross-validation; jackknife; molecular descriptors; random forest; variable selection; ADAPTIVE LASSO; REGRESSION; QSAR; CLASSIFICATION;
D O I
10.1002/cem.3522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, we developed a simple technique for effective parameter estimation and prediction of the quantitative structure activity relationship studies using a two-step procedure. The first step is to choose the important molecular descriptors using the random forest regression, and the second step is to optimally predict the biological activity of the selected chemical compounds using the following estimators: ridge regression, jackknife ridge, Liu regression, jackknife Liu, Kibria-Lukman, and jackknife Kibria-Lukman. We conducted a simulation study and a real-life analysis with a quantitative structure-activity relationship (QSAR) data with 2540 descriptors after preprocessing. The optimal prediction is determined using the cross-validation error. The estimator with minimum cross-validation error is considered best. It is obvious that performing jackknife estimation after random forest selection is preferred. In this study, we developed a simple technique for effective parameter estimation and prediction of the quantitative structure activity relationship studies (QSAR) using a two-step procedure. We conducted a simulation study and a real-life application with QSAR data with 2540 descriptors after preprocessing. The optimal prediction is determined using the cross-validation error. The performance of the methods is judged using the root mean squared error of prediction. It is obvious that performing jackknife estimation after random forest selection is preferred.
引用
收藏
页数:9
相关论文
共 50 条
[41]   Quantitative Structure-Activity Relationship (QSAR) modeling to predict the transfer of environmental chemicals across the placenta [J].
Leveque, Laura ;
Tahiri, Nadia ;
Goldsmith, Michael-Rock ;
Verner, Marc-Andre .
COMPUTATIONAL TOXICOLOGY, 2022, 21
[42]   Structural Similarity Based Kriging for Quantitative Structure Activity and Property Relationship Modeling [J].
Teixeira, Ana L. ;
Falcao, Andre O. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (07) :1833-1849
[43]   Application of Hansch's model to guaianolide ester derivatives:: A quantitative structure-activity relationship study [J].
Macías, FA ;
Velasco, RF ;
Castellano, D ;
Galindo, JCG .
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2005, 53 (09) :3530-3539
[44]   A Quantitative Structure-Activity Relationship (QSAR) study of the antioxidant activity of flavonoids [J].
Rasulev, BF ;
Abdullaev, ND ;
Syrov, VN ;
Leszczynski, J .
QSAR & COMBINATORIAL SCIENCE, 2005, 24 (09) :1056-1065
[45]   Structure Activity Relationship and Quantitative Structure-Activity Relationships Modeling of Antitrypanosomal Activities of Alkyldiamine Cryptolepine Derivatives [J].
Belaidi, Salah ;
Salah, Toufik ;
Melkemi, Nadjib ;
Sinha, Leena ;
Prasad, Onkar .
JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) :2421-2427
[46]   Quantitative Structure-Activity Relationship Study of Aromatic Inhibitors Against Rat Lens Aldose Reductase Activity Using Variable Selections [J].
Jung, Mankil ;
Lee, Yongnam ;
Shim, Minjoo ;
Lim, Eunyoung ;
Lee, Eun Jig ;
Lee, Hyun Chul .
MEDICINAL CHEMISTRY, 2013, 9 (03) :410-419
[47]   Structure-based quantitative structure-activity relationship modeling of estrogen receptor β-ligands [J].
Dong, Xialan ;
Hilliard, Solomon G. ;
Zheng, Weifan .
FUTURE MEDICINAL CHEMISTRY, 2011, 3 (08) :933-945
[48]   Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study [J].
Luo, Li ;
Hudson, Laurie G. ;
Lewis, Johnnye ;
Lee, Ji-Hyun .
ENVIRONMENTAL HEALTH, 2019, 18
[49]   A Quantitative Structure Activity Relationship for acute oral toxicity of pesticides on rats: Validation, domain of application and prediction [J].
Hamadache, Mabrouk ;
Benkortbi, Othmane ;
Hanini, Salah ;
Amrane, Abdeltif ;
Khaouane, Latifa ;
Moussa, Cherif Si .
JOURNAL OF HAZARDOUS MATERIALS, 2016, 303 :28-40
[50]   Prediction of the relationship between the structural features of andrographolide derivatives and α-glucosidase inhibitory activity: A quantitative structure-activity relationship (QSAR) Study [J].
Moorthy, N. S. Hari Narayana ;
Ramos, Maria J. ;
Fernandes, Pedro A. .
JOURNAL OF ENZYME INHIBITION AND MEDICINAL CHEMISTRY, 2011, 26 (01) :78-87