An Approximated Collapsed Variational Bayes Approach to Variable Selection in Linear Regression

被引:1
|
作者
You, Chong [1 ,2 ]
Ormerod, John T. [3 ]
Li, Xiangyang [4 ,5 ]
Pang, Cheng Heng [6 ]
Zhou, Xiao-Hua [1 ,2 ,7 ]
机构
[1] Peking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R China
[2] Peking Univ, Chongqing Big Data Res Inst, Chongqing, Peoples R China
[3] Univ Sydney, Sch Math & Stat, Sydney, Australia
[4] Peking Univ, Sch Math Sci, Beijing, Peoples R China
[5] Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
[6] Univ Nottingham Ningbo China, Fac Sci & Engn, Ningbo, Peoples R China
[7] Peking Univ, Sch Publ Hlth, Dept Biostat, Beijing, Peoples R China
基金
澳大利亚研究理事会;
关键词
Collapsed Gibbs sampling; Consistency; Markov chain Monte Carlo; NONCONCAVE PENALIZED LIKELIHOOD; STOCHASTIC SEARCH; PRIORS;
D O I
10.1080/10618600.2022.2149539
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this work, we propose a novel approximated collapsed variational Bayes approach to model selection in linear regression. The approximated collapsed variational Bayes algorithm offers improvements over mean field variational Bayes by marginalizing over a subset of parameters and using mean field variational Bayes over the remaining parameters in an analogous fashion to collapsed Gibbs sampling. We have shown that the proposed algorithm, under typical regularity assumptions, (a) includes variables in the true underlying model at an exponential rate in the sample size, or (b) excludes the variables at least at the first order rate in the sample size if the variables are not in the true model. Simulation studies show that the performance of the proposed method is close to that of a particular Markov chain Monte Carlo sampler and a path search based variational Bayes algorithm, but requires an order of magnitude less time. The proposed method is also highly competitive with penalized methods, expectation propagation, stepwise AIC/BIC, BMS, and EMVS under various settings. for the article are available online.
引用
收藏
页码:782 / 792
页数:11
相关论文
共 50 条
  • [41] Variable selection for linear regression models with random covariates
    Nkiet, GM
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 2001, 333 (12): : 1105 - 1110
  • [42] Improved variable selection procedure for multivariate linear regression
    Walmsley, AD
    ANALYTICA CHIMICA ACTA, 1997, 354 (1-3) : 225 - 232
  • [43] A robust and efficient variable selection method for linear regression
    Yang, Zhuoran
    Fu, Liya
    Wang, You-Gan
    Dong, Zhixiong
    Jiang, Yunlu
    JOURNAL OF APPLIED STATISTICS, 2022, 49 (14) : 3677 - 3692
  • [44] Robust Bayesian nonparametric variable selection for linear regression
    Cabezas, Alberto
    Battiston, Marco
    Nemeth, Christopher
    STAT, 2024, 13 (02):
  • [45] Bayesian structured variable selection in linear regression models
    Min Wang
    Xiaoqian Sun
    Tao Lu
    Computational Statistics, 2015, 30 : 205 - 229
  • [46] Variable selection in semiparametric linear regression with censored data
    Johnson, Brent A.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 351 - 370
  • [47] An RKHS model for variable selection in functional linear regression
    Berrendero, Jose R.
    Bueno-Larraz, Beatriz
    Cuevas, Antonio
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 170 : 25 - 45
  • [48] Bootstrapping multiple linear regression after variable selection
    Lasanthi C. R. Pelawa Watagoda
    David J. Olive
    Statistical Papers, 2021, 62 : 681 - 700
  • [49] Variable selection in linear regression based on ridge estimator
    Dorugade, A. V.
    Kashid, D. N.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2010, 80 (11) : 1211 - 1224
  • [50] Non-linear variable selection in a regression context
    Hill, Simon I.
    PROCEEDINGS OF THE 5TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2007, : 441 - 445