On constrained and regularized high-dimensional regression

被引:55
作者
Shen, Xiaotong [1 ]
Pan, Wei [2 ]
Zhu, Yunzhang [1 ]
Zhou, Hui [2 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Div Biostat, Minneapolis, MN 55455 USA
关键词
Constrained regression; Parameter and nonparametric models; Nonconvex regularization; Difference convex programming; (p; n) versus fixed p-asymptotics; NONCONCAVE PENALIZED LIKELIHOOD; MODEL SELECTION; VARIABLE SELECTION; LASSO;
D O I
10.1007/s10463-012-0396-3
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High-dimensional feature selection has become increasingly crucial for seeking parsimonious models in estimation. For selection consistency, we derive one necessary and sufficient condition formulated on the notion of degree of separation. The minimal degree of separation is necessary for any method to be selection consistent. At a level slightly higher than the minimal degree of separation, selection consistency is achieved by a constrained -method and its computational surrogate-the constrained truncated -method. This permits up to exponentially many features in the sample size. In other words, these methods are optimal in feature selection against any selection method. In contrast, their regularization counterparts-the -regularization and truncated -regularization methods enable so under slightly stronger assumptions. More importantly, sharper parameter estimation/prediction is realized through such selection, leading to minimax parameter estimation. This, otherwise, is impossible in the absence of a good selection method for high-dimensional analysis.
引用
收藏
页码:807 / 832
页数:26
相关论文
共 25 条
[1]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267
[2]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[3]  
Chen SSB, 2001, SIAM REV, V43, P129, DOI [10.1137/S003614450037906X, 10.1137/S1064827596304010]
[4]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[5]  
Gu C, 1998, STAT SINICA, V8, P607
[6]  
Ibragimov IA, 1981, Statistical Estimation
[7]  
Judge GG., 1978, STAT IMPLICATIONS PR
[8]   Smoothly Clipped Absolute Deviation on High Dimensions [J].
Kim, Yongdai ;
Choi, Hosik ;
Oh, Hee-Seok .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) :1665-1673
[9]  
Liu W., 2010, CONSISTENCY BI UNPUB
[10]   A UNIFIED APPROACH TO MODEL SELECTION AND SPARSE RECOVERY USING REGULARIZED LEAST SQUARES [J].
Lv, Jinchi ;
Fan, Yingying .
ANNALS OF STATISTICS, 2009, 37 (6A) :3498-3528