Structural identification and variable selection in high-dimensional varying-coefficient models

被引:6
作者
Chen, Yuping [1 ]
Bai, Yang [1 ,2 ]
Fung, Wingkam [3 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[2] SUFE, Minist Educ China, Key Lab Math Econ, Shanghai, Peoples R China
[3] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Varying-coefficient models; structural selection; modified basis expansion; group selection; coordinate descent; NONCONCAVE PENALIZED LIKELIHOOD; EFFICIENT ESTIMATION; SPLINE ESTIMATION; REGRESSION; INFERENCE; LASSO;
D O I
10.1080/10485252.2017.1303057
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Varying-coefficient models have been widely used to investigate the possible time-dependent effects of covariates when the response variable comes from normal distribution. Much progress has been made for inference and variable selection in the framework of such models. However, the identification of model structure, that is how to identify which covariates have time-varying effects and which have fixed effects, remains a challenging and unsolved problem especially when the dimension of covariates is much larger than the sample size. In this article, we consider the structural identification and variable selection problems in varying-coefficient models for high-dimensional data. Using a modified basis expansion approach and group variable selection methods, we propose a unified procedure to simultaneously identify the model structure, select important variables and estimate the coefficient curves. The unique feature of the proposed approach is that we do not have to specify the model structure in advance, therefore, it is more realistic and appropriate for real data analysis. Asymptotic properties of the proposed estimators have been derived under regular conditions. Furthermore, we evaluate the finite sample performance of the proposed methods with Monte Carlo simulation studies and a real data analysis.
引用
收藏
页码:258 / 279
页数:22
相关论文
共 53 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]  
Antoniadis A., 2013, STAT PAP, V55, P1
[4]   Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors [J].
Breheny, Patrick ;
Huang, Jian .
STATISTICS AND COMPUTING, 2015, 25 (02) :173-187
[5]   COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION [J].
Breheny, Patrick ;
Huang, Jian .
ANNALS OF APPLIED STATISTICS, 2011, 5 (01) :232-253
[6]   Efficient estimation and inferences for varying-coefficient models [J].
Cai, ZW ;
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) :888-902
[7]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[8]   Forward Variable Selection for Sparse Ultra-High Dimensional Varying Coefficient Models [J].
Cheng, Ming-Yen ;
Honda, Toshio ;
Zhang, Jin-Ting .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (515) :1209-1221
[9]   NONPARAMETRIC INDEPENDENCE SCREENING AND STRUCTURE IDENTIFICATION FOR ULTRA-HIGH DIMENSIONAL LONGITUDINAL DATA [J].
Cheng, Ming-Yen ;
Honda, Toshio ;
Li, Jialiang ;
Peng, Heng .
ANNALS OF STATISTICS, 2014, 42 (05) :1819-1849
[10]   Smoothing spline estimation for varying coefficient models with repeatedly measured dependent variables [J].
Chiang, CT ;
Rice, JA ;
Wu, CO .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) :605-619