Smoothly Clipped Absolute Deviation on High Dimensions

被引:201
作者
Kim, Yongdai [1 ]
Choi, Hosik [2 ]
Oh, Hee-Seok [1 ]
机构
[1] Seoul Natl Univ, Dept Stat, Seoul, South Korea
[2] Hoseo Univ, Dept Informat Stat, Asan, Chungnam, South Korea
关键词
High dimension; Oracle property; Regression; Regularization; Smoothly clipped absolutely deviation penalty;
D O I
10.1198/016214508000001066
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The smoothly clipped absolute deviation (SCAD) estimator, proposed by Fan and Li, has many desirable properties, including continuity, sparsity, and unbiasedness. The SCAD estimator also has the (asymptotically) oracle property when the dimension of covariates is fixed or diverges more slowly than the sample size. In this article we study the SCAD estimator in high-dimensional settings where the dimension of covariates can be much larger than the sample size. First, we develop and efficient optimization algorithm that is fast and always converges to a local minimum. Second, we prove that the SCAD estimator still has the oracle property on high-dimensional problems. We perform numerical studies to compare the SCAD estimator with the LASSO and SIS-SCAD estimators in terms of prediction accuracy and variable selectivity when the true model is sparse. Through the simulation, we show that the variance estimator of Fan and Li still works well for some limited high-dimensional cases where the true nonzero coefficients are not too small and the sample size is moderately large. We apply the proposed algorithm to analyze a high-dimensional microarray data set.
引用
收藏
页码:1665 / 1673
页数:9
相关论文
共 25 条
[1]  
An LTH, 1997, J GLOBAL OPTIM, V11, P253
[2]   Regularization of wavelet approximations - Rejoinder [J].
Antoniadis, A ;
Fan, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) :964-967
[3]  
Bertsekas D., 1999, NONLINEAR PROGRAMMIN
[4]  
Breiman L, 1996, ANN STAT, V24, P2350
[5]  
Collins P, 2000, NAT REV NEUROSCI, V1, P7, DOI 10.1038/35036178
[6]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[7]   Nonconcave penalized likelihood with a diverging number of parameters [J].
Fan, JQ ;
Peng, H .
ANNALS OF STATISTICS, 2004, 32 (03) :928-961
[8]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[9]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[10]  
Friedman JH, 2004, GRADIENT DIRECTED RE