Stochastic gradient Hamiltonian Monte Carlo with variance reduction for Bayesian inference

被引:0
作者
Zhize Li
Tianyi Zhang
Shuyu Cheng
Jun Zhu
Jian Li
机构
[1] Tsinghua University,
来源
Machine Learning | 2019年 / 108卷
关键词
Hamiltonian Monte Carlo; Variance reduction; Bayesian inference;
D O I
暂无
中图分类号
学科分类号
摘要
Gradient-based Monte Carlo sampling algorithms, like Langevin dynamics and Hamiltonian Monte Carlo, are important methods for Bayesian inference. In large-scale settings, full-gradients are not affordable and thus stochastic gradients evaluated on mini-batches are used as a replacement. In order to reduce the high variance of noisy stochastic gradients, Dubey et al. (in: Advances in neural information processing systems, pp 1154–1162, 2016) applied the standard variance reduction technique on stochastic gradient Langevin dynamics and obtained both theoretical and experimental improvements. In this paper, we apply the variance reduction tricks on Hamiltonian Monte Carlo and achieve better theoretical convergence results compared with the variance-reduced Langevin dynamics. Moreover, we apply the symmetric splitting scheme in our variance-reduced Hamiltonian Monte Carlo algorithms to further improve the theoretical results. The experimental results are also consistent with the theoretical results. As our experiment shows, variance-reduced Hamiltonian Monte Carlo demonstrates better performance than variance-reduced Langevin dynamics in Bayesian regression and classification tasks on real-world datasets.
引用
收藏
页码:1701 / 1727
页数:26
相关论文
共 11 条
[1]  
Byrne S(2013)Geodesic Monte Carlo on embedded manifolds Scandinavian Journal of Statistics 40 825-845
[2]  
Girolami M(1987)Hybrid Monte Carlo Physics Letters B 195 216-222
[3]  
Duane S(2011)Riemann manifold langevin and Hamiltonian Monte Carlo methods Journal of the Royal Statistical Society: Series B (Statistical Methodology) 73 123-214
[4]  
Kennedy AD(2016)Adaptive thermostats for noisy gradient systems SIAM Journal on Scientific Computing 38 A712-A736
[5]  
Pendleton BJ(2011)MCMC using hamiltonian dynamics Handbook of Markov Chain Monte Carlo 2 139-188
[6]  
Roweth D(undefined)undefined undefined undefined undefined-undefined
[7]  
Girolami M(undefined)undefined undefined undefined undefined-undefined
[8]  
Calderhead B(undefined)undefined undefined undefined undefined-undefined
[9]  
Leimkuhler B(undefined)undefined undefined undefined undefined-undefined
[10]  
Shang X(undefined)undefined undefined undefined undefined-undefined