Variational Bayes for High-Dimensional Linear Regression With Sparse Priors

被引:48
作者
Ray, Kolyan [1 ]
Szabo, Botond [2 ]
机构
[1] Imperial Coll London, Dept Math, London, England
[2] Vrije Univ Amsterdam, Dept Math, De Boelelaan 1105, NL-1081 HV Amsterdam, Netherlands
关键词
Model selection; Oracle inequalities; Sparsity; Spike-and-slab prior; Variational Bayes; EMPIRICAL BAYES; VARIABLE SELECTION; POSTERIOR CONCENTRATION; CONVERGENCE-RATES; INFERENCE; NEEDLES; STRAW; SPIKE;
D O I
10.1080/01621459.2020.1847121
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study a mean-field spike and slab variational Bayes (VB) approximation to Bayesian model selection priors in sparse high-dimensional linear regression. Under compatibility conditions on the design matrix, oracle inequalities are derived for the mean-field VB approximation, implying that it converges to the sparse truth at the optimal rate and gives optimal prediction of the response vector. The empirical performance of our algorithm is studied, showing that it works comparably well as other state-of-the-art Bayesian variable selection methods. We also numerically demonstrate that the widely used coordinate-ascent variational inference algorithm can be highly sensitive to the parameter updating order, leading to potentially poor performance. To mitigate this, we propose a novel prioritized updating scheme that uses a data-driven updating order and performs better in simulations. The variational algorithm is implemented in the R package sparsevb. for this article are available online.
引用
收藏
页码:1270 / 1281
页数:12
相关论文
共 46 条
[1]   CONCENTRATION OF TEMPERED POSTERIORS AND OF THEIR VARIATIONAL APPROXIMATIONS [J].
Alquier, Pierre ;
Ridgway, James .
ANNALS OF STATISTICS, 2020, 48 (03) :1475-1497
[2]  
[Anonymous], 2016, ARXIV160207640
[3]  
Banerjee S., 2020, SURVEY PAPER BAYESIA
[4]   EMPIRICAL BAYES ORACLE UNCERTAINTY QUANTIFICATION FOR REGRESSION [J].
Belitser, Eduard ;
Ghosal, Subhashis .
ANNALS OF STATISTICS, 2020, 48 (06) :3113-3137
[5]   Needles and straw in a haystack: Robust confidence for possibly sparse sequences [J].
Belitser, Eduard ;
Nurushev, Nurzhan .
BERNOULLI, 2020, 26 (01) :191-225
[6]   Variational Inference: A Review for Statisticians [J].
Blei, David M. ;
Kucukelbir, Alp ;
McAuliffe, Jon D. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :859-877
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]  
Boucheron Stephane, 2013, Concentration inequalities: A nonasymptotic theory of independence, DOI 10.1093/acprof:oso/9780199535255.001.0001
[9]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[10]   Scalable Variational Inference for Bayesian Variable Selection in Regression, and Its Accuracy in Genetic Association Studies [J].
Carbonetto, Peter ;
Stephens, Matthew .
BAYESIAN ANALYSIS, 2012, 7 (01) :73-107