Evaluating the Variance of Likelihood-Ratio Gradient Estimators

被引:0
作者
Tokui, Seiya [1 ,2 ]
Sato, Issei [2 ,3 ]
机构
[1] Preferred Networks, Tokyo, Japan
[2] Univ Tokyo, Tokyo, Japan
[3] RIKEN, Tokyo, Japan
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The likelihood-ratio method is often used to estimate gradients of stochastic computations, for which baselines are required to reduce the estimation variance. Many types of baselines have been proposed, although their degree of optimality is not well understood. In this study, we establish a novel framework of gradient estimation that includes most of the common gradient estimators as special cases. The framework gives a natural derivation of the optimal estimator that can be interpreted as a special case of the likelihood-ratio method so that we can evaluate the optimal degree of practical techniques with it. It bridges the likelihood-ratio method and the reparameterization trick while still supporting discrete variables. It is derived from the exchange property of the differentiation and integration. To be more specific, it is derived by the reparameterization trick and local marginalization analogous to the local expectation gradient. We evaluate various baselines and the optimal estimator for variational learning and show that the performance of the modern estimators is close to the optimal estimator.
引用
收藏
页数:10
相关论文
共 34 条
[1]  
[Anonymous], 2016, ARXIV161100712
[2]  
[Anonymous], 2016, ARXIV161102247
[3]  
[Anonymous], 2016, P 32 C UNC ART INT
[4]  
[Anonymous], 2013, CoRR abs/1308.3432
[5]  
[Anonymous], 2001, Proceedings of the 17th Conference on Uncertainty in Artificial Intelligence
[6]  
[Anonymous], 2010, Advances in Neural Information Processing Systems
[7]  
Burda Yuri, 2015, ICLR
[8]  
GAL Y, 2017, THESIS
[9]  
Glorot X., 2010, P 13 INT C ART INT S, P249, DOI DOI 10.1109/LGRS.2016.2565705
[10]   OPTIMIZATION OF STOCHASTIC-SYSTEMS VIA SIMULATION [J].
GLYNN, PW .
1989 WINTER SIMULATION CONFERENCE PROCEEDINGS, 1989, :90-105