Variational Hybrid Monte Carlo for Efficient Multi-Modal Data Sampling

被引:3
作者
Sun, Shiliang [1 ]
Zhao, Jing [1 ]
Gu, Minghao [1 ]
Wang, Shanhu [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
关键词
Markov chain Monte Carlo; Hamiltonian Monte Carlo; Langevin dynamics; multi-modal sampling; variational distribution;
D O I
10.3390/e25040560
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The Hamiltonian Monte Carlo (HMC) sampling algorithm exploits Hamiltonian dynamics to construct efficient Markov Chain Monte Carlo (MCMC), which has become increasingly popular in machine learning and statistics. Since HMC uses the gradient information of the target distribution, it can explore the state space much more efficiently than random-walk proposals, but may suffer from high autocorrelation. In this paper, we propose Langevin Hamiltonian Monte Carlo (LHMC) to reduce the autocorrelation of the samples. Probabilistic inference involving multi-modal distributions is very difficult for dynamics-based MCMC samplers, which is easily trapped in the mode far away from other modes. To tackle this issue, we further propose a variational hybrid Monte Carlo (VHMC) which uses a variational distribution to explore the phase space and find new modes, and it is capable of sampling from multi-modal distributions effectively. A formal proof is provided that shows that the proposed method can converge to target distributions. Both synthetic and real datasets are used to evaluate its properties and performance. The experimental results verify the theory and show superior performance in multi-modal sampling.
引用
收藏
页数:21
相关论文
共 38 条
  • [1] Ahn S., 2013, P 16 INT C ART INT S, P108
  • [2] Smart darting Monte Carlo
    Andricioaei, I
    Straub, JE
    Voter, AF
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2001, 114 (16) : 6994 - 7000
  • [3] Asuncion A., 2007, UCI MACHINE LEARNING
  • [4] Variational Inference: A Review for Statisticians
    Blei, David M.
    Kucukelbir, Alp
    McAuliffe, Jon D.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 859 - 877
  • [5] Integrating structured biological data by Kernel Maximum Mean Discrepancy
    Borgwardt, Karsten M.
    Gretton, Arthur
    Rasch, Malte J.
    Kriegel, Hans-Peter
    Schoelkopf, Bernhard
    Smola, Alex J.
    [J]. BIOINFORMATICS, 2006, 22 (14) : E49 - E57
  • [6] Brooks S, 2011, CH CRC HANDB MOD STA, pXIX
  • [7] STOCHASTIC BOUNDARY-CONDITIONS FOR MOLECULAR-DYNAMICS SIMULATIONS OF ST2 WATER
    BRUNGER, A
    BROOKS, CL
    KARPLUS, M
    [J]. CHEMICAL PHYSICS LETTERS, 1984, 105 (05) : 495 - 500
  • [8] ACCURATE STATIONARY DENSITIES WITH PARTITIONED NUMERICAL METHODS FOR STOCHASTIC DIFFERENTIAL EQUATIONS
    Burrage, Kevin
    Lythe, Grant
    [J]. SIAM JOURNAL ON NUMERICAL ANALYSIS, 2009, 47 (03) : 1601 - 1618
  • [9] Computational and inferential difficulties with mixture posterior distributions.
    Celeux, G
    Hurn, M
    Robert, CP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) : 957 - 970
  • [10] HYBRID MONTE-CARLO
    DUANE, S
    KENNEDY, AD
    PENDLETON, BJ
    ROWETH, D
    [J]. PHYSICS LETTERS B, 1987, 195 (02) : 216 - 222