High-dimensional linear mixed model selection by partial correlation

被引:3
|
作者
Alabiso, Audry [1 ]
Shang, Junfeng [2 ]
机构
[1] Progress Casualty Insurance, Private Passenger Auto, Mayfield, OH USA
[2] Bowling Green State Univ, Dept Math & Stat, Bowling Green, OH 43403 USA
关键词
Mixed model selection; partial correlation; linear mixed models; VARIABLE SELECTION;
D O I
10.1080/03610926.2022.2028838
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We wish to perform variable selection in high-dimensional linear mixed models where the number of the potential covariates is much larger than the sample size and where the random effects are utilized to describe correlated observations. We propose a variable selection procedure based on the Thresholded Partial Correlation (TPC) algorithm (Li, Liu, and Lou 2017) to conduct variable selection using the partial correlation between the covariates and the response variable conditional on the random effects, and this procedure is called the conditional Thresholded Partial Correlation, denoted by TPCc. This TPCc approach is able to select the fixed effects in high-dimensional data when the covariates are highly correlated. We investigate the performance of the proposed method (TPCc) in a variety of simulated high-dimensional data sets. The simulation results show that the TPCc outperforms the TPC in selecting the most appropriate model among the candidate pool in the mixed modeling setting. We also apply the proposed method to a real high-dimensional data set in the production of riboflavin.
引用
收藏
页码:6355 / 6380
页数:26
相关论文
共 50 条
  • [31] Fixed Effects Testing in High-Dimensional Linear Mixed Models
    Bradic, Jelena
    Claeskens, Gerda
    Gueuning, Thomas
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1835 - 1850
  • [32] Variational Bayesian Inference in High-Dimensional Linear Mixed Models
    Yi, Jieyi
    Tang, Niansheng
    MATHEMATICS, 2022, 10 (03)
  • [33] Persistence in high-dimensional linear predictor selection and the virtue of overparametrization
    Greenshtein, E
    Ritov, Y
    BERNOULLI, 2004, 10 (06) : 971 - 988
  • [34] Scalable Algorithms for Learning High-Dimensional Linear Mixed Models
    Tan, Zilong
    Roche, Kimberly
    Zhou, Xiang
    Mukherjee, Sayan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 259 - 268
  • [35] The sparsity and bias of the lasso selection in high-dimensional linear regression
    Zhang, Cun-Hui
    Huang, Jian
    ANNALS OF STATISTICS, 2008, 36 (04): : 1567 - 1594
  • [36] Variable selection in high-dimensional double generalized linear models
    Xu, Dengke
    Zhang, Zhongzhan
    Wu, Liucang
    STATISTICAL PAPERS, 2014, 55 (02) : 327 - 347
  • [37] Variable selection in high-dimensional partly linear additive models
    Lian, Heng
    JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (04) : 825 - 839
  • [38] Variable selection in high-dimensional double generalized linear models
    Dengke Xu
    Zhongzhan Zhang
    Liucang Wu
    Statistical Papers, 2014, 55 : 327 - 347
  • [39] Group selection in high-dimensional partially linear additive models
    Wei, Fengrong
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2012, 26 (03) : 219 - 243
  • [40] Doubly regularized estimation and selection in linear mixed-effects models for high-dimensional longitudinal data
    Li, Yun
    Wang, Sijian
    Song, Peter X-K
    Wang, Naisyin
    Zhou, Ling
    Zhu, Ji
    STATISTICS AND ITS INTERFACE, 2018, 11 (04) : 721 - 737