Adaptation of the Independent Metropolis-Hastings Sampler with Normalizing Flow Proposals
被引:0
|
作者:
Brofos, James A.
论文数: 0引用数: 0
h-index: 0
机构:
Yale Univ, New Haven, CT 06520 USAYale Univ, New Haven, CT 06520 USA
Brofos, James A.
[1
]
Gabrie, Marylou
论文数: 0引用数: 0
h-index: 0
机构:
NYU, CDS, New York, NY 10003 USA
Flatiron Inst, CCM, New York, NY USAYale Univ, New Haven, CT 06520 USA
Gabrie, Marylou
[2
,3
]
Brubaker, Marcus A.
论文数: 0引用数: 0
h-index: 0
机构:
York Univ, N York, ON, Canada
Vector Inst, Toronto, ON, CanadaYale Univ, New Haven, CT 06520 USA
Brubaker, Marcus A.
[4
,5
]
Lederman, Roy R.
论文数: 0引用数: 0
h-index: 0
机构:
Yale Univ, New Haven, CT 06520 USAYale Univ, New Haven, CT 06520 USA
Lederman, Roy R.
[1
]
机构:
[1] Yale Univ, New Haven, CT 06520 USA
[2] NYU, CDS, New York, NY 10003 USA
[3] Flatiron Inst, CCM, New York, NY USA
[4] York Univ, N York, ON, Canada
[5] Vector Inst, Toronto, ON, Canada
来源:
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151
|
2022年
/
151卷
基金:
加拿大自然科学与工程研究理事会;
美国国家科学基金会;
关键词:
ERGODICITY;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Markov Chain Monte Carlo (MCMC) methods are a powerful tool for computation with complex probability distributions. However the performance of such methods is critically dependent on properly tuned parameters, most of which are difficult if not impossible to know a priori for a given target distribution. Adaptive MCMC methods aim to address this by allowing the parameters to be updated during sampling based on previous samples from the chain at the expense of requiring a new theoretical analysis to ensure convergence. In this work we extend the convergence theory of adaptive MCMC methods to a new class of methods built on a powerful class of parametric density estimators known as normalizing flows. In particular, we consider an independent Metropolis-Hastings sampler where the proposal distribution is represented by a normalizing flow whose parameters are updated using stochastic gradient descent. We explore the practical performance of this procedure on both synthetic settings and in the analysis of a physical field system, and compare it against both adaptive and non-adaptive MCMC methods.