LARGE SAMPLE PROPERTIES OF THE SCAD-PENALIZED MAXIMUM LIKELIHOOD ESTIMATION ON HIGH DIMENSIONS

被引:32
作者
Kwon, Sunghoon [1 ]
Kim, Yongdai [2 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Seoul Natl Univ, Dept Stat, Seoul 151742, South Korea
基金
新加坡国家研究基金会;
关键词
High dimension; maximum likelihood estimator; MLE; oracle property; SCAD; smoothly clipped absolute deviation; variable selection; VARIABLE SELECTION; LASSO;
D O I
10.5705/ss.2010.027
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we study large sample properties of smoothly clipped absolute deviation (SCAD) penalized maximum likelihood estimation for high-dimensional parameters. First, we prove that the oracle maximum likelihood estimator (MLE) asymptotically becomes a local maximizer of the SCAD-penalized log-likelihood, even when the number of parameters is much larger than the sample size; the oracle MLE is an ideal non-penalized MLE obtained by deleting all irrelevant parameters in advance. Second, we prove that if the log-likelihood is strictly concave, the oracle MLE asymptotically becomes the global maximizer of the SCAD-penalized log-likelihood with a diverging number of parameters that is less than the sample size. Various numerical experiments on simulated data sets are presented to verify the theoretical results, and two data examples are analyzed.
引用
收藏
页码:629 / 653
页数:25
相关论文
共 26 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]   Regularization of wavelet approximations - Rejoinder [J].
Antoniadis, A ;
Fan, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) :964-967
[3]  
Breiman L, 1996, ANN STAT, V24, P2350
[4]   Estimation and comparison of multiple change-point models [J].
Chib, S .
JOURNAL OF ECONOMETRICS, 1998, 86 (02) :221-241
[5]   Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[6]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[7]  
Fan JQ, 2010, STAT SINICA, V20, P101
[8]   Nonconcave penalized likelihood with a diverging number of parameters [J].
Fan, JQ ;
Peng, H .
ANNALS OF STATISTICS, 2004, 32 (03) :928-961
[9]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[10]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135