Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models

被引:183
作者
Fan, Jianqing [1 ]
Ma, Yunbei [3 ]
Dai, Wei [2 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Ctr Stat Res, Beijing 100080, Peoples R China
[2] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
[3] Southwestern Univ Finance & Econ, Sch Stat, Chengdu 611130, Sichuan, Peoples R China
基金
美国国家科学基金会; 美国国家卫生研究院; 中国国家自然科学基金;
关键词
False positive rates; Sure independence screening; Conditional permutation; Sparsity; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; ORACLE PROPERTIES; REGRESSION; REGULARIZATION; LASSO;
D O I
10.1080/01621459.2013.879828
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The varying coefficient model is an important class of nonparametric statistical model, which allows us to examine how the effects of covariates vary with exposure variables. When the number of covariates is large, the issue of variable selection arises. In this article, we propose and investigate marginal nonparametric screening methods to screen variables in sparse ultra-high-dimensional varying coefficient models. The proposed nonparametric independence screening (NIS) selects variables by ranking a measure of the nonparametric marginal contributions of each covariate given the exposure variable. The sure independent screening property is established under some mild technical conditions when the dimensionality is of nonpolynomial order, and the dimensionality reduction of NIS is quantified. To enhance the practical utility and finite sample performance, two data-driven iterative NIS (INIS) methods are proposed for selecting thresholding parameters and variables: conditional permutation and greedy methods, resulting in conditional-INIS and greedy-INIS. The effectiveness and flexibility of the proposed methods are further illustrated by simulation studies and real data applications.
引用
收藏
页码:1270 / 1284
页数:15
相关论文
共 26 条
[1]  
[Anonymous], 1978, A Practical Guide to Splines
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]   Regularization of wavelet approximations - Rejoinder [J].
Antoniadis, A ;
Fan, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) :964-967
[4]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[5]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[6]   Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models [J].
Fan, Jianqing ;
Feng, Yang ;
Song, Rui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :544-557
[7]   SURE INDEPENDENCE SCREENING IN GENERALIZED LINEAR MODELS WITH NP-DIMENSIONALITY [J].
Fan, Jianqing ;
Song, Rui .
ANNALS OF STATISTICS, 2010, 38 (06) :3567-3604
[8]  
Fan JQ, 2008, STAT INTERFACE, V1, P179
[9]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[10]   Generalized likelihood ratio statistics and Wilks phenomenon [J].
Fan, JQ ;
Zhang, CM ;
Zhang, J .
ANNALS OF STATISTICS, 2001, 29 (01) :153-193