Robust nonnegative garrote variable selection in linear regression

被引:18
作者
Gijbels, I. [1 ]
Vrinssen, I.
机构
[1] Katholieke Univ Leuven, Dept Math, B-3001 Leuven, Belgium
关键词
Multiple linear regression; MM-estimation; Nonnegative garrote; S-estimation; Variable selection; HIGH BREAKDOWN-POINT; SHRINKAGE;
D O I
10.1016/j.csda.2014.11.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Robust selection of variables in a linear regression model is investigated. Many variable selection methods are available, but very few methods are designed to avoid sensitivity to vertical outliers as well as to leverage points. The nonnegative garrote method is a powerful variable selection method, developed originally for linear regression but recently successfully extended to more complex regression models. The method has good performances and its theoretical properties have been established. The aim is to robustify the nonnegative garrote method for linear regression as to make it robust to vertical outliers and leverage points. Several approaches are discussed, and recommendations towards a final good performing robust nonnegative garrote method are given. The proposed method is evaluated via a simulation study that also includes a comparison with existing methods. The method performs very well, and often outperforms existing methods. A real data application illustrates the use of the method in practice. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 32 条
[1]   Robust Model Selection with LARS Based on S-estimators [J].
Agostinelli, Claudio ;
Salibian-Barrera, Matias .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :69-78
[2]   SPARSE LEAST TRIMMED SQUARES REGRESSION FOR ANALYZING HIGH-DIMENSIONAL LARGE DATA SETS [J].
Alfons, Andreas ;
Croux, Christophe ;
Gelper, Sarah .
ANNALS OF APPLIED STATISTICS, 2013, 7 (01) :226-248
[3]  
[Anonymous], 2006, TECHNICAL REPORT
[4]  
[Anonymous], 2006, Wiley Series in Probability and Statistics, DOI DOI 10.1002/0470010940
[5]   Variable Selection in Additive Models Using P-Splines [J].
Antoniadis, Anestis ;
Gijbels, Irene ;
Verhasselt, Anneleen .
TECHNOMETRICS, 2012, 54 (04) :425-438
[6]   Variable Selection in Varying-Coefficient Models Using P-Splines [J].
Antoniadis, Anestis ;
Gijbels, Irene ;
Verhasselt, Anneleen .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2012, 21 (03) :638-661
[7]   Weighted LAD-LASSO method for robust parameter estimation and variable selection in regression [J].
Arslan, Olcay .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) :1952-1965
[8]   Efficient determination of multiple regularization parameters in a generalized L-curve framework [J].
Belge, M ;
Kilmer, ME ;
Miller, EL .
INVERSE PROBLEMS, 2002, 18 (04) :1161-1183
[9]   BETTER SUBSET REGRESSION USING THE NONNEGATIVE GARROTE [J].
BREIMAN, L .
TECHNOMETRICS, 1995, 37 (04) :373-384
[10]   Breakdown and groups - Rejoinder [J].
Davies, PL ;
Gather, U .
ANNALS OF STATISTICS, 2005, 33 (03) :1016-1035