A subsampling approach for Bayesian model selection

被引：1

作者：

Lachmann, Jon ^{[1
]}

Storvik, Geir ^{[2
]}

Frommlet, Florian ^{[3
]}

Hubin, Aliaksandr ^{[2
,4
]}

机构：

[1] Stockholm Univ, Dept Stat, Stockholm, Sweden

[2] Univ Oslo, Dept Math, Oslo, Norway

[3] Med Univ Vienna, CEMSIIS, Vienna, Austria

[4] Univ Oslo, Moltke Moes Vei 35, N-0851 Oslo, Norway

来源：

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING | 2022年 / 151卷

关键词：

Bayesian model selection; Bayesian model averaging; Subsampling; MCMC; Tall data; CONVERGENCE; LIKELIHOOD; OPTIMIZATION; INFERENCE; MCMC;

D O I：

10.1016/j.ijar.2022.08.018

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is common practice to use Laplace approximations to decrease the computational burden when computing the marginal likelihoods in Bayesian versions of generalised linear models (GLM). Marginal likelihoods combined with model priors are then used in different search algorithms to compute the posterior marginal probabilities of models and individual covariates. This allows performing Bayesian model selection and model averaging. For large sample sizes, even the Laplace approximation becomes computationally challenging because the optimisation routine involved needs to evaluate the likelihood on the full dataset in multiple iterations. As a consequence, the algorithm is not scalable for large datasets. To address this problem, we suggest using stochastic optimisation approaches, which only use a subsample of the data for each iteration. We combine stochastic optimisation with Markov chain Monte Carlo (MCMC) based methods for Bayesian model selection and provide some theoretical results on the convergence of the estimates for the resulting time-inhomogeneous MCMC. Finally, we report results from experiments illustrating the performance of the proposed algorithm.(c) 2022 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

引用

页码：33 / 63

页数：31

共 47 条

[1] Agarwal N, 2017, J MACH LEARN RES, V18
[2] Ando T., 2010, BAYESIAN MODEL SELEC
[3] THE PSEUDO-MARGINAL APPROACH FOR EFFICIENT MONTE CARLO COMPUTATIONS
Andrieu, Christophe
Roberts, Gareth O.
[J]. ANNALS OF STATISTICS, 2009, 37 (02) : 697 - 725
[4] [Anonymous], 2008, Advances in neural information processing systems
[5] Optimal predictive model selection
Barbieri, MM
Berger, JO
[J]. ANNALS OF STATISTICS, 2004, 32 (03) : 870 - 897
[6] FITTING OF POWER-SERIES, MEANING POLYNOMIALS, ILLUSTRATED ON BAND-SPECTROSCOPIC DATA
BEATON, AE
TUKEY, JW
[J]. TECHNOMETRICS, 1974, 16 (02) : 147 - 185
[7] Metaheuristics in combinatorial optimization: Overview and conceptual comparison
Blum, C
Roli, A
[J]. ACM COMPUTING SURVEYS, 2003, 35 (03) : 268 - 308
[8] Optimization Methods for Large-Scale Machine Learning
Bottou, Leon
Curtis, Frank E.
Nocedal, Jorge
[J]. SIAM REVIEW, 2018, 60 (02) : 223 - 311
[9] A STOCHASTIC QUASI-NEWTON METHOD FOR LARGE-SCALE OPTIMIZATION
Byrd, R. H.
Hansen, S. L.
Nocedal, Jorge
Singer, Y.
[J]. SIAM JOURNAL ON OPTIMIZATION, 2016, 26 (02) : 1008 - 1031
[10] Cauchy A., 1847, C.R. Hebd. Seances Sci. Paris, V25, P536

← 1 2 3 4 5 →