Arbitrary conditional inference in variational autoencoders via fast prior network training

被引:0
作者
Ga Wu
Justin Domke
Scott Sanner
机构
[1] University of Toronto,Department of Mechanical and Industrial Engineering
[2] University of Massachusetts,College of Computing and Information Sciences
来源
Machine Learning | 2022年 / 111卷
关键词
Variational autoencoder; Conditional inference; Prior network;
D O I
暂无
中图分类号
学科分类号
摘要
Variational Autoencoders (VAEs) are a popular generative model, but one in which conditional inference can be challenging. If the decomposition into query and evidence variables is fixed, conditionally trained VAEs provide an attractive solution. However, to efficiently support arbitrary queries over pre-trained VAEs when the query and evidence are not known in advance, one is generally reduced to MCMC sampling methods that can suffer from long mixing times. In this paper, we propose an idea of efficiently training small conditional prior networks to approximate the latent distribution of the VAE after conditioning on an evidence assignment; this permits generating query samples without retraining the full VAE. We experimentally evaluate three variations of conditional prior networks showing that (i) they can be quickly optimized for different decompositions of evidence and query and (ii) they quantitatively and qualitatively outperform existing state-of-the-art methods for conditional inference in pre-trained VAEs.
引用
收藏
页码:2537 / 2559
页数:22
相关论文
共 14 条
  • [1] Girolami M(2011)Riemann manifold langevin and hamiltonian monte carlo methods Journal of the Royal Statistical Society: Series B (Statistical Methodology) 73 123-214
  • [2] Calderhead B(2017)Globally and locally consistent image completion ACM Transactions on Graphics (ToG) 36 1-14
  • [3] Iizuka S(1989)On the limited memory bfgs method for large scale optimization Mathematical Programming 45 503-528
  • [4] Simo-Serra E(1996)Mean field theory for sigmoid belief networks Journal of Artificial Intelligence Research 4 61-76
  • [5] Ishikawa H(2004)Image quality assessment: From error visibility to structural similarity IEEE Transactions on Image Processing 13 600-612
  • [6] Liu DC(undefined)undefined undefined undefined undefined-undefined
  • [7] Nocedal J(undefined)undefined undefined undefined undefined-undefined
  • [8] Saul LK(undefined)undefined undefined undefined undefined-undefined
  • [9] Jaakkola TS(undefined)undefined undefined undefined undefined-undefined
  • [10] Jordan MI(undefined)undefined undefined undefined undefined-undefined