Post selection shrinkage estimation for high-dimensional data analysis

被引:26
作者
Gao, Xiaoli [1 ]
Ahmed, S. E. [2 ]
Feng, Yang [3 ]
机构
[1] Univ North Carolina Greensboro, Dept Math & Stat, Greensboro, NC 27412 USA
[2] Brock Univ, Dept Math, St Catharines, ON, Canada
[3] Columbia Univ, Dept Stat, New York, NY USA
基金
美国国家科学基金会;
关键词
asymptotic risk; lasso; ridge regression; (positive) shrinkage estimation; post selection; sparse model; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; ADAPTIVE LASSO; REGRESSION; SPARSITY;
D O I
10.1002/asmb.2193
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In high-dimensional data settings where p >> n, many penalized regularization approaches were studied for simultaneous variable selection and estimation. However, with the existence of covariates with weak effect, many existing variable selection methods, including Lasso and its generations, cannot distinguish covariates with weak and no contribution. Thus, prediction based on a subset model of selected covariates only can be inefficient. In this paper, we propose a post selection shrinkage estimation strategy to improve the prediction performance of a selected subset model. Such a post selection shrinkage estimator (PSE) is data adaptive and constructed by shrinking a post selection weighted ridge estimator in the direction of a selected candidate subset. Under an asymptotic distributional quadratic risk criterion, its prediction performance is explored analytically. We show that the proposed post selection PSE performs better than the post selection weighted ridge estimator. More importantly, it improves the prediction performance of any candidate subset model selected from most existing Lasso-type variable selection methods significantly. The relative performance of the post selection PSE is demonstrated by both simulation studies and real-data analysis. (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:97 / 120
页数:24
相关论文
共 33 条
[1]   Shrinkage, pretest and absolute penalty estimators in partially linear models [J].
Ahmed, S. Ejaz ;
Doksum, Kjell A. ;
Hossain, S. ;
You, Jinhong .
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2007, 49 (04) :435-454
[2]   Shrinkage estimation strategy in quasi-likelihood models [J].
Ahmed, S. Ejaz ;
Fallahpour, Saber .
STATISTICS & PROBABILITY LETTERS, 2012, 82 (12) :2170-2179
[3]   LASSO and shrinkage estimation in Weibull censored regression models [J].
Ahmed, S. Ejaz ;
Hossain, Shakhawat ;
Doksum, Kjell A. .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (06) :1273-1284
[4]  
Ahmed SE., 2014, PENALTY SHRINKAGE PR
[5]  
[Anonymous], 1995, Economic growth, 1995
[6]  
Barro R. J., 1994, DATA SET PANEL 139 C
[7]   Least squares after model selection in high-dimensional sparse models [J].
Belloni, Alexandre ;
Chernozhukov, Victor .
BERNOULLI, 2013, 19 (02) :521-547
[8]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[9]   Atomic decomposition by basis pursuit [J].
Chen, SSB ;
Donoho, DL ;
Saunders, MA .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 20 (01) :33-61
[10]  
Durlauf SN, 2005, HANDB ECON, V22, P555