A subsampling approach for Bayesian model selection

被引:1
作者
Lachmann, Jon [1 ]
Storvik, Geir [2 ]
Frommlet, Florian [3 ]
Hubin, Aliaksandr [2 ,4 ]
机构
[1] Stockholm Univ, Dept Stat, Stockholm, Sweden
[2] Univ Oslo, Dept Math, Oslo, Norway
[3] Med Univ Vienna, CEMSIIS, Vienna, Austria
[4] Univ Oslo, Moltke Moes Vei 35, N-0851 Oslo, Norway
关键词
Bayesian model selection; Bayesian model averaging; Subsampling; MCMC; Tall data; CONVERGENCE; LIKELIHOOD; OPTIMIZATION; INFERENCE; MCMC;
D O I
10.1016/j.ijar.2022.08.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is common practice to use Laplace approximations to decrease the computational burden when computing the marginal likelihoods in Bayesian versions of generalised linear models (GLM). Marginal likelihoods combined with model priors are then used in different search algorithms to compute the posterior marginal probabilities of models and individual covariates. This allows performing Bayesian model selection and model averaging. For large sample sizes, even the Laplace approximation becomes computationally challenging because the optimisation routine involved needs to evaluate the likelihood on the full dataset in multiple iterations. As a consequence, the algorithm is not scalable for large datasets. To address this problem, we suggest using stochastic optimisation approaches, which only use a subsample of the data for each iteration. We combine stochastic optimisation with Markov chain Monte Carlo (MCMC) based methods for Bayesian model selection and provide some theoretical results on the convergence of the estimates for the resulting time-inhomogeneous MCMC. Finally, we report results from experiments illustrating the performance of the proposed algorithm.(c) 2022 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:33 / 63
页数:31
相关论文
共 47 条
  • [1] Agarwal N, 2017, J MACH LEARN RES, V18
  • [2] Ando T., 2010, BAYESIAN MODEL SELEC
  • [3] THE PSEUDO-MARGINAL APPROACH FOR EFFICIENT MONTE CARLO COMPUTATIONS
    Andrieu, Christophe
    Roberts, Gareth O.
    [J]. ANNALS OF STATISTICS, 2009, 37 (02) : 697 - 725
  • [4] [Anonymous], 2008, Advances in neural information processing systems
  • [5] Optimal predictive model selection
    Barbieri, MM
    Berger, JO
    [J]. ANNALS OF STATISTICS, 2004, 32 (03) : 870 - 897
  • [6] FITTING OF POWER-SERIES, MEANING POLYNOMIALS, ILLUSTRATED ON BAND-SPECTROSCOPIC DATA
    BEATON, AE
    TUKEY, JW
    [J]. TECHNOMETRICS, 1974, 16 (02) : 147 - 185
  • [7] Metaheuristics in combinatorial optimization: Overview and conceptual comparison
    Blum, C
    Roli, A
    [J]. ACM COMPUTING SURVEYS, 2003, 35 (03) : 268 - 308
  • [8] Optimization Methods for Large-Scale Machine Learning
    Bottou, Leon
    Curtis, Frank E.
    Nocedal, Jorge
    [J]. SIAM REVIEW, 2018, 60 (02) : 223 - 311
  • [9] A STOCHASTIC QUASI-NEWTON METHOD FOR LARGE-SCALE OPTIMIZATION
    Byrd, R. H.
    Hansen, S. L.
    Nocedal, Jorge
    Singer, Y.
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2016, 26 (02) : 1008 - 1031
  • [10] Cauchy A., 1847, C.R. Hebd. Seances Sci. Paris, V25, P536