Ultra-High Dimensional Single-Index Quantile Regression

被引:1
作者
Zhang, Yuankun [1 ]
Lian, Heng [2 ]
Yu, Yan [3 ]
机构
[1] Univ Cincinnati, Dept Math Sci, Cincinnati, OH 45221 USA
[2] City Univ Hong Kong, Dept Math, Hong Kong, Peoples R China
[3] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH 45221 USA
关键词
High-dimensional data; nonparametric; oracle property; variable selection; SCAD; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; MODELS; SCORE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a flexible semiparametric single-index quantile regression model where the number of covariates may be ultra-high dimensional, and the number of the relevant covariates is potentially diverging. The approach is particularly appealing to uncover the complex heterogeneity in high-dimensional data, incorporate nonlinearity and potential interaction, avoid the curse of dimensionality, and allow different variables to be included at different quantile levels. We estimate the unknown function via polynomial splines nonparametrically and adopt a nonconvex penalty function to identify the sparse variable set. We further extend it to partially linear single-index quantile model where both the single-index components in the nonparametric term and the partially linear components can be in ultra-high dimension. However, a number of major challenges arise in developing both theory and computation: (a) The model is highly nonlinear in single-index coefficients because the high-dimensional single-index covariates are embedded inside the unknown flexible function. (b) The data are ultra-high dimensional where the dimension of the single-index covariates (p(n)) is diverging or even in the exponential order of sample size n. (c) The objective function is non-smooth for quantile regression. (d) Nonconvex variable selection such as SCAD is adopted for regularization. (e) The extended partially linear single-index quantile models may include both ultra-high dimensional (p(n)) single-index covariates and ultra-high dimensional (q(n)) partially linear covariates. We develop a novel approach using empirical process techniques in establishing the theoretical properties of the nonconvex penalized estimators for partially linear single-index quantile models and show those estimators indeed possess the oracle property in ultra-high dimensional setting. We propose an efficient algorithm to circumvent the computational challenges. The results of Monte Carlo simulations and an application to gene expression data demonstrate the effectiveness of the proposed models and estimation method.
引用
收藏
页数:25
相关论文
共 63 条
[1]   FAST GLOBAL CONVERGENCE OF GRADIENT METHODS FOR HIGH-DIMENSIONAL STATISTICAL RECOVERY [J].
Agarwal, Alekh ;
Negahban, Sahand ;
Wainwright, Martin J. .
ANNALS OF STATISTICS, 2012, 40 (05) :2452-2482
[2]  
Alquier P, 2013, J MACH LEARN RES, V14, P243
[3]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[4]   l1-PENALIZED QUANTILE REGRESSION IN HIGH-DIMENSIONAL SPARSE MODELS [J].
Belloni, Alexandre ;
Chernozhukov, Victor .
ANNALS OF STATISTICS, 2011, 39 (01) :82-130
[5]  
Bhatia K., 2015, ADV NEURAL INFORM PR
[6]   Generalized partially linear single-index models [J].
Carroll, RJ ;
Fan, JQ ;
Gijbels, I ;
Wand, MP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) :477-489
[7]  
Chen H, 2019, J MACH LEARN RES, V20
[8]   THE EFM APPROACH FOR SINGLE-INDEX MODELS [J].
Cui, Xia ;
Haerdle, Wolfgang Karl ;
Zhu, Lixing .
ANNALS OF STATISTICS, 2011, 39 (03) :1658-1688
[9]   SLICING REGRESSION - A LINK-FREE REGRESSION METHOD [J].
DUAN, N ;
LI, KC .
ANNALS OF STATISTICS, 1991, 19 (02) :505-530
[10]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883