Coupling-based convergence assessment of some Gibbs samplers for high-dimensional Bayesian regression with shrinkage priors

被引:5
作者
Biswas, Niloy [1 ]
Bhattacharya, Anirban [2 ]
Jacob, Pierre E. [3 ]
Johndrow, James E. [4 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
[2] Texas A&M Univ, College Stn, TX USA
[3] ESSEC Business Sch, Cergy Pontoise, France
[4] Univ Penn, Wharton Sch, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Bayesian inference; couplings; Gibbs sampling; Horseshoe prior; parallel computation; CHAIN MONTE-CARLO; GEOMETRIC ERGODICITY; HORSESHOE ESTIMATOR; PRIOR DISTRIBUTIONS; VARIABLE SELECTION; REGULARIZATION; COMPLEXITY; RATES;
D O I
10.1111/rssb.12495
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider Markov chain Monte Carlo (MCMC) algorithms for Bayesian high-dimensional regression with continuous shrinkage priors. A common challenge with these algorithms is the choice of the number of iterations to perform. This is critical when each iteration is expensive, as is the case when dealing with modern data sets, such as genome-wide association studies with thousands of rows and up to hundreds of thousands of columns. We develop coupling techniques tailored to the setting of high-dimensional regression with shrinkage priors, which enable practical, non-asymptotic diagnostics of convergence without relying on traceplots or long-run asymptotics. By establishing geometric drift and minorization conditions for the algorithm under consideration, we prove that the proposed couplings have finite expected meeting time. Focusing on a class of shrinkage priors which includes the 'Horseshoe', we empirically demonstrate the scalability of the proposed couplings. A highlight of our findings is that less than 1000 iterations can be enough for a Gibbs sampler to reach stationarity in a regression on 100,000 covariates. The numerical results also illustrate the impact of the prior on the computational efficiency of the coupling, and suggest the use of priors where the local precisions are Half-t distributed with degree of freedom larger than one.
引用
收藏
页码:973 / 996
页数:24
相关论文
共 67 条
  • [1] BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA
    ALBERT, JH
    CHIB, S
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) : 669 - 679
  • [2] Approximate Spectral Gaps for Markov Chain Mixing Times in High Dimensions
    Atchade, Yves F.
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (03): : 854 - 872
  • [3] One-shot CFTP; Application to a class of truncated Gaussian densities
    Beskos, A
    Roberts, G
    [J]. METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2005, 7 (04) : 407 - 437
  • [4] Lasso Meets Horseshoe: A Survey
    Bhadra, Anindya
    Datta, Jyotishka
    Polson, Nicholas G.
    Willard, Brandon
    [J]. STATISTICAL SCIENCE, 2019, 34 (03) : 405 - 427
  • [5] Bhattacharya A., 2021, WILEY STATSREF STAT, V1, P1, DOI [10.1002/9781118445112.stat08292, DOI 10.1002/9781118445112.STAT08292]
  • [6] Fast sampling with Gaussian scale mixture priors in high-dimensional regression
    Bhattacharya, Anirban
    Chakraborty, Antik
    Mallick, Bani K.
    [J]. BIOMETRIKA, 2016, 103 (04) : 985 - 991
  • [7] Dirichlet-Laplace Priors for Optimal Shrinkage
    Bhattacharya, Anirban
    Pati, Debdeep
    Pillai, Natesh S.
    Dunson, David B.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (512) : 1479 - 1490
  • [8] Geometric ergodicity of Gibbs samplers for the Horseshoe and its regularized variants
    Bhattacharya, Suman
    Khare, Kshitij
    Pal, Subhadip
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 1 - 57
  • [9] Biswas N., 2019, ADV NEURAL INFORM PR, P7389
  • [10] Two-scale coupling for preconditioned Hamiltonian Monte Carlo in infinite dimensions
    Bou-Rabee, Nawaf
    Eberle, Andreas
    [J]. STOCHASTICS AND PARTIAL DIFFERENTIAL EQUATIONS-ANALYSIS AND COMPUTATIONS, 2021, 9 (01): : 207 - 242