Least absolute deviation estimator-bridge variable selection and estimation for quantitative structure-activity relationship model

被引:7
作者
Al-Dabbagh, Zainab Tawfeeq [1 ]
Algamal, Zakariya Yahya [2 ]
机构
[1] Univ Mosul, Dept Operat Res & Artificial Intelligence, Mosul, Iraq
[2] Univ Mosul, Dept Stat & Informat, Mosul, Iraq
关键词
bridge penalty; heavy-tailed distribution; LAD; penalized method; QSAR; NONCONCAVE PENALIZED LIKELIHOOD; REGRESSION SHRINKAGE; ADAPTIVE LASSO;
D O I
10.1002/cem.3139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regression models are frequently encountered in many scientific fields, especially in quantitative structure-activity relationship (QSAR) modeling. The traditional estimation of regression model parameters is based on the normal assumption of the response variable, and, therefore, it is sensitive to outliers or heavy-tailed distributions. Robust penalized regression methods have been given considerable attention because they combine the robust estimation method with penalty terms to perform parameter estimation and variable selection simultaneously. In this paper, based on the bridge penalty, a robust variable selection and parameter estimation is proposed as a method that is resistant to the existence of outliers or heavy-tailed errors. The basic idea is to combine the least absolute deviation estimator (LAD) and the bridge penalty together to produce the LAD-bridge method. The effectiveness of the proposed method is examined through simulation studies and application to real chemometrics data. The obtained results confirm that the LAD-bridge can significantly reduce the prediction error compared with other existing methods.
引用
收藏
页数:8
相关论文
共 25 条
[1]   A sparse QSRR model for predicting retention indices of essential oils based on robust screening approach [J].
Al-Fakih, A. M. ;
Algamal, Z. Y. ;
Lee, M. H. ;
Aziz, M. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2017, 28 (08) :691-703
[2]   High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm [J].
Algamal, Z. Y. ;
Lee, M. H. ;
Al-Fakih, A. M. ;
Aziz, M. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2016, 27 (09) :703-719
[3]   High-dimensional quantitative structure-activity relationship modeling of influenza neuraminidase a/PR/8/34 (H1N1) inhibitors based on a two-stage adaptive penalized rank regression [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam ;
Al-Fakih, Abdo Mohammed .
JOURNAL OF CHEMOMETRICS, 2016, 30 (02) :50-57
[4]   High-dimensional QSAR prediction of anticancer potency of imidazo[4,5-b]pyridine derivatives using adjusted adaptive LASSO [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam ;
Al-Fakih, Abdo M. ;
Aziz, Madzlan .
JOURNAL OF CHEMOMETRICS, 2015, 29 (10) :547-556
[5]   Weighted LAD-LASSO method for robust parameter estimation and variable selection in regression [J].
Arslan, Olcay .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) :1952-1965
[6]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[7]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[8]   Penalized regressions: The bridge versus the lasso [J].
Fu, WJJ .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (03) :397-416
[9]  
Gao XL, 2010, STAT SINICA, V20, P1485
[10]   Asymptotic properties of bridge estimators in sparse high-dimensional regression models [J].
Huang, Jian ;
Horowitz, Joel L. ;
Ma, Shuangge .
ANNALS OF STATISTICS, 2008, 36 (02) :587-613