Coupling-based convergence assessment of some Gibbs samplers for high-dimensional Bayesian regression with shrinkage priors

被引：5

作者：

Biswas, Niloy ^{[1
]}

Bhattacharya, Anirban ^{[2
]}

Jacob, Pierre E. ^{[3
]}

Johndrow, James E. ^{[4
]}

机构：

[1] Harvard Univ, Cambridge, MA 02138 USA

[2] Texas A&M Univ, College Stn, TX USA

[3] ESSEC Business Sch, Cergy Pontoise, France

[4] Univ Penn, Wharton Sch, Philadelphia, PA 19104 USA

来源：

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY | 2022年 / 84卷 / 03期

基金：

美国国家科学基金会;

关键词：

Bayesian inference; couplings; Gibbs sampling; Horseshoe prior; parallel computation; CHAIN MONTE-CARLO; GEOMETRIC ERGODICITY; HORSESHOE ESTIMATOR; PRIOR DISTRIBUTIONS; VARIABLE SELECTION; REGULARIZATION; COMPLEXITY; RATES;

D O I：

10.1111/rssb.12495

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We consider Markov chain Monte Carlo (MCMC) algorithms for Bayesian high-dimensional regression with continuous shrinkage priors. A common challenge with these algorithms is the choice of the number of iterations to perform. This is critical when each iteration is expensive, as is the case when dealing with modern data sets, such as genome-wide association studies with thousands of rows and up to hundreds of thousands of columns. We develop coupling techniques tailored to the setting of high-dimensional regression with shrinkage priors, which enable practical, non-asymptotic diagnostics of convergence without relying on traceplots or long-run asymptotics. By establishing geometric drift and minorization conditions for the algorithm under consideration, we prove that the proposed couplings have finite expected meeting time. Focusing on a class of shrinkage priors which includes the 'Horseshoe', we empirically demonstrate the scalability of the proposed couplings. A highlight of our findings is that less than 1000 iterations can be enough for a Gibbs sampler to reach stationarity in a regression on 100,000 covariates. The numerical results also illustrate the impact of the prior on the computational efficiency of the coupling, and suggest the use of priors where the local precisions are Half-t distributed with degree of freedom larger than one.

引用

页码：973 / 996

页数：24

共 67 条

[1] BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA
ALBERT, JH
CHIB, S
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) : 669 - 679
[2] Approximate Spectral Gaps for Markov Chain Mixing Times in High Dimensions
Atchade, Yves F.
[J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (03): : 854 - 872
[3] One-shot CFTP; Application to a class of truncated Gaussian densities
Beskos, A
Roberts, G
[J]. METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2005, 7 (04) : 407 - 437
[4] Lasso Meets Horseshoe: A Survey
Bhadra, Anindya
Datta, Jyotishka
Polson, Nicholas G.
Willard, Brandon
[J]. STATISTICAL SCIENCE, 2019, 34 (03) : 405 - 427
[5] Bhattacharya A., 2021, WILEY STATSREF STAT, V1, P1, DOI [10.1002/9781118445112.stat08292, DOI 10.1002/9781118445112.STAT08292]
[6] Fast sampling with Gaussian scale mixture priors in high-dimensional regression
Bhattacharya, Anirban
Chakraborty, Antik
Mallick, Bani K.
[J]. BIOMETRIKA, 2016, 103 (04) : 985 - 991
[7] Dirichlet-Laplace Priors for Optimal Shrinkage
Bhattacharya, Anirban
Pati, Debdeep
Pillai, Natesh S.
Dunson, David B.
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (512) : 1479 - 1490
[8] Geometric ergodicity of Gibbs samplers for the Horseshoe and its regularized variants
Bhattacharya, Suman
Khare, Kshitij
Pal, Subhadip
[J]. ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 1 - 57
[9] Biswas N., 2019, ADV NEURAL INFORM PR, P7389
[10] Two-scale coupling for preconditioned Hamiltonian Monte Carlo in infinite dimensions
Bou-Rabee, Nawaf
Eberle, Andreas
[J]. STOCHASTICS AND PARTIAL DIFFERENTIAL EQUATIONS-ANALYSIS AND COMPUTATIONS, 2021, 9 (01): : 207 - 242

← 1 2 3 4 5 6 7 →