CALIBRATING NONCONVEX PENALIZED REGRESSION IN ULTRA-HIGH DIMENSION

被引:127
作者
Wang, Lan [1 ]
Kim, Yongdai [2 ]
Li, Runze [3 ,4 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Seoul Natl Univ, Dept Stat, Seoul, South Korea
[3] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[4] Penn State Univ, Methodol Ctr, University Pk, PA 16802 USA
基金
中国国家自然科学基金; 美国国家科学基金会; 新加坡国家研究基金会;
关键词
High-dimensional regression; LASSO; MCP; SCAD; variable selection; penalized least squares; CLIPPED ABSOLUTE DEVIATION; VARIABLE SELECTION; MODEL SELECTION; DIVERGING NUMBER; ADAPTIVE LASSO; LIKELIHOOD; REGULARIZATION; SHRINKAGE; CRITERIA;
D O I
10.1214/13-AOS1159
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We investigate high-dimensional nonconvex penalized regression, where the number of covariates may grow at an exponential rate. Although recent asymptotic theory established that there exists a local minimum possessing the oracle property under general conditions, it is still largely an open problem how to identify the oracle estimator among potentially multiple local minima. There are two main obstacles: (1) due to the presence of multiple minima, the solution path is nonunique and is not guaranteed to contain the oracle estimator; (2) even if a solution path is known to contain the oracle estimator, the optimal tuning parameter depends on many unknown factors and is hard to estimate. To address these two challenging issues, we first prove that an easy-to-calculate calibrated CCCP algorithm produces a consistent solution path which contains the oracle estimator with probability approaching one. Furthermore, we propose a high-dimensional BIC criterion and show that it can be applied to the solution path to select the optimal tuning parameter which asymptotically identifies the oracle estimator. The theory for a general class of nonconvex penalties in the ultra-high dimensional setup is established when the random errors follow the sub-Gaussian distribution. Monte Carlo studies confirm that the calibrated CCCP algorithm combined with the proposed high-dimensional BIC has desirable performance in identifying the underlying sparsity pattern for high-dimensional data analysis.
引用
收藏
页码:2505 / 2536
页数:32
相关论文
共 39 条
[1]  
[Anonymous], 1997, ACTA MATH VIETNAM
[2]  
[Anonymous], 1999, Athena scientific Belmont
[3]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[4]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[5]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[6]   Nonconcave Penalized Likelihood With NP-Dimensionality [J].
Fan, Jianqing ;
Lv, Jinchi .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2011, 57 (08) :5467-5484
[7]   Nonconcave penalized likelihood with a diverging number of parameters [J].
Fan, JQ ;
Peng, H .
ANNALS OF STATISTICS, 2004, 32 (03) :928-961
[8]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[9]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[10]   Angiotensin AT1 Receptor Antagonism Ameliorates Murine Retinal Proteome Changes Induced by Diabetes [J].
Gao, Ben-Bo ;
Phipps, Joanna A. ;
Bursell, Dahlia ;
Clermont, Allen C. ;
Feener, Edward P. .
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (12) :5541-5549