Model selection for generalized linear models with weak factors

被引:0
作者
Zhou, Xin [1 ]
Dong, Yan [1 ]
Yu, Qin [1 ]
Zheng, Zemin [1 ]
机构
[1] Univ Sci & Technol China, Int Inst Finance, Sch Management, Hefei 230026, Peoples R China
关键词
correlated covariates; generalized linear models; high dimensionality; model selection consistency; weak factors; VARIABLE SELECTION; REGRESSION; NUMBER;
D O I
10.1002/sta4.697
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The literature has witnessed an upsurge of interest in model selection in diverse fields and optimization applications. Despite the substantial progress, model selection remains a significant challenge when covariates are highly correlated, particularly within economic and financial datasets that exhibit cross-sectional and serial dependency. In this paper, we introduce a novel methodology named factor augmented regularized model selection with weak factors (WeakFARM) for generalized linear models in the presence of correlated covariates with weak latent factor structure. By identifying weak latent factors and idiosyncratic components and employing them as predictors, WeakFARM converts the challenge from model selection with highly correlated covariates to that with weakly correlated ones. Furthermore, we develop a variable screening method based on the proposed WeakFARM method. Comprehensive theoretical guarantees including estimation consistency, model selection consistency and sure screening property are also provided. We demonstrate the effectiveness of our approach by extensive simulation studies and a real data application in economic forecasting.
引用
收藏
页数:14
相关论文
共 31 条
[1]   Determining the number of factors in approximate factor models [J].
Bai, JS ;
Ng, S .
ECONOMETRICA, 2002, 70 (01) :191-221
[2]   Approximate factor models with weaker loadings [J].
Bai, Jushan ;
Ng, Serena .
JOURNAL OF ECONOMETRICS, 2023, 235 (02) :1893-1916
[3]   Principal components estimation and identification of static factors [J].
Bai, Jushan ;
Ng, Serena .
JOURNAL OF ECONOMETRICS, 2013, 176 (01) :18-29
[4]   High dimensional stochastic regression with latent factors, endogeneity and nonlinearity [J].
Chang, Jinyuan ;
Guo, Bin ;
Yao, Qiwei .
JOURNAL OF ECONOMETRICS, 2015, 189 (02) :297-312
[5]  
Chen LK, 2018, J MACH LEARN RES, V18
[6]  
Chi CM, 2025, Arxiv, DOI arXiv:2112.09851
[7]  
Dobson A.J., 2002, INTRO GEN LINEAR MOD, V2nd
[8]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[9]   Factor-adjusted regularized model selection [J].
Fan, Jianqing ;
Ke, Yuan ;
Wang, Kaizheng .
JOURNAL OF ECONOMETRICS, 2020, 216 (01) :71-85
[10]   Large covariance estimation by thresholding principal orthogonal complements [J].
Fan, Jianqing ;
Liao, Yuan ;
Mincheva, Martina .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (04) :603-680