Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models

被引:183
作者
Fan, Jianqing [1 ]
Ma, Yunbei [3 ]
Dai, Wei [2 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Ctr Stat Res, Beijing 100080, Peoples R China
[2] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
[3] Southwestern Univ Finance & Econ, Sch Stat, Chengdu 611130, Sichuan, Peoples R China
基金
美国国家科学基金会; 美国国家卫生研究院; 中国国家自然科学基金;
关键词
False positive rates; Sure independence screening; Conditional permutation; Sparsity; Variable selection; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; ORACLE PROPERTIES; REGRESSION; REGULARIZATION; LASSO;
D O I
10.1080/01621459.2013.879828
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The varying coefficient model is an important class of nonparametric statistical model, which allows us to examine how the effects of covariates vary with exposure variables. When the number of covariates is large, the issue of variable selection arises. In this article, we propose and investigate marginal nonparametric screening methods to screen variables in sparse ultra-high-dimensional varying coefficient models. The proposed nonparametric independence screening (NIS) selects variables by ranking a measure of the nonparametric marginal contributions of each covariate given the exposure variable. The sure independent screening property is established under some mild technical conditions when the dimensionality is of nonpolynomial order, and the dimensionality reduction of NIS is quantified. To enhance the practical utility and finite sample performance, two data-driven iterative NIS (INIS) methods are proposed for selecting thresholding parameters and variables: conditional permutation and greedy methods, resulting in conditional-INIS and greedy-INIS. The effectiveness and flexibility of the proposed methods are further illustrated by simulation studies and real data applications.
引用
收藏
页码:1270 / 1284
页数:15
相关论文
共 26 条
[21]   Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements [J].
Wang, Lifeng ;
Li, Hongzhe ;
Huang, Jianhua Z. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) :1556-1569
[22]   NEARLY UNBIASED VARIABLE SELECTION UNDER MINIMAX CONCAVE PENALTY [J].
Zhang, Cun-Hui .
ANNALS OF STATISTICS, 2010, 38 (02) :894-942
[23]   Principled sure independence screening for Cox models with ultra-high-dimensional covariates [J].
Zhao, Sihai Dave ;
Li, Yi .
JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 105 (01) :397-411
[24]   Regularization and variable selection via the elastic net [J].
Zou, H ;
Hastie, T .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2005, 67 :301-320
[25]   One-step sparse estimates in nonconcave penalized likelihood models [J].
Zou, Hui ;
Li, Runze .
ANNALS OF STATISTICS, 2008, 36 (04) :1509-1533
[26]   The adaptive lasso and its oracle properties [J].
Zou, Hui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (476) :1418-1429