From screening to variable selection by an iterative nonparametric procedure based on derivatives

被引:0
作者
Giordano, Francesco [1 ]
Milito, Sara [1 ]
Parrella, Maria Lucia [1 ]
机构
[1] Univ Salerno, Dept Econ & Stat, Via Giovanni Paolo II 132, I-84084 Fisciano, SA, Italy
关键词
High dimension; Variable selection; Nonparametric regression models; Variable screening; Iterative procedure; REGRESSION; MODELS;
D O I
10.1007/s00362-025-01700-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
To identify the true relevant predictors in a nonparametric regression model with ultra-high-dimensional data, we propose a general iterative procedure that is able to transform a screening procedure into a variable selection one, without requiring the number of true relevant variables to be finite. This fully nonparametric procedure is based on a novel combination of DELSIS (a model-free variable screening method based on empirical likelihood and derivative estimation) and PenGAM (a penalised variable selection method based on splines). The main advantage of the new proposal is its robustness to the presence of correlation among the predictors and its capability to correctly identify the whole set of relevant covariates, including those marginally uncorrelated but jointly related to the response ("hidden covariates") and those with low signal ("weak covariates"), much better than the main alternative approaches. From a theoretical point of view, we show the consistency of the proposal and its faster estimation rate with respect to the classical penalised approaches. Finally, we illustrate the performance of the new procedure through some simulations and an empirical analysis.
引用
收藏
页数:35
相关论文
共 13 条
[1]  
Benson J, 2021, V1, DOI [10.17632/zzg4kfj785.1, 10.17632/zzg4kfj785.1, DOI 10.17632/ZZG4KFJ785.1]
[2]   LOCAL INDEPENDENCE FEATURE SCREENING FOR NONPARAMETRIC AND SEMIPARAMETRIC MODELS BY MARGINAL EMPIRICAL LIKELIHOOD [J].
Chang, Jinyuan ;
Tang, Cheng Yong ;
Wu, Yichao .
ANNALS OF STATISTICS, 2016, 44 (02) :515-539
[3]  
Fan J., 1996, Local Polynomial Modelling and Its Applications, DOI DOI 10.1201/9780203748725
[4]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[5]   Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models [J].
Fan, Jianqing ;
Feng, Yang ;
Song, Rui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :544-557
[6]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[7]   A nonparametric procedure for linear and nonlinear variable screening [J].
Giordano, F. ;
Milito, S. ;
Parrella, M. L. .
JOURNAL OF NONPARAMETRIC STATISTICS, 2022, 34 (04) :859-894
[8]   Component selection and smoothing in multivariate nonparametric regression [J].
Lin, Yi ;
Zhang, Hao Helen .
ANNALS OF STATISTICS, 2006, 34 (05) :2272-2297
[9]   HIGH-DIMENSIONAL ADDITIVE MODELING [J].
Meier, Lukas ;
van de Geer, Sara ;
Buehlmann, Peter .
ANNALS OF STATISTICS, 2009, 37 (6B) :3779-3821
[10]  
OWEN A., 2001, EMPIRICAL LIKELIHOOD