Black-box Coreset Variational Inference

被引:0
作者
Manousakas, Dionysis [1 ]
Ritter, Hippolyt [1 ]
Karaletsos, Theofanis [2 ]
机构
[1] Meta, New York, NY 10011 USA
[2] Insitro, San Francisco, CA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in coreset methods have shown that a selection of representative datapoints can replace massive volumes of data for Bayesian inference, preserving the relevant statistical information and significantly accelerating subsequent downstream tasks. Existing variational coreset constructions rely on either selecting subsets of the observed datapoints, or jointly performing approximate inference and optimizing pseudodata in the observed space akin to inducing points methods in Gaussian Processes. So far, both approaches are limited by complexities in evaluating their objectives for general purpose models, and require generating samples from a typically intractable posterior over the coreset throughout inference and testing. In this work, we present a black-box variational inference framework for coresets that overcomes these constraints and enables principled application of variational coresets to intractable models, such as Bayesian neural networks. We apply our techniques to supervised learning problems, and compare them with existing approaches in the literature for data summarization and inference.
引用
收藏
页数:13
相关论文
共 58 条
  • [1] Agarwal P. K., 2005, Combinatorial and computational geometry
  • [2] Bachem O., 2017, PRACTICAL CORESET CO
  • [3] Bachem Olivier., 2015, ICML
  • [4] Balles L., 2021, GRADIENT MATCHING CO
  • [5] Bingham E., 2019, JMLR
  • [6] Borsos Z., 2021, ICASSP
  • [7] Borsos Zalan, 2020, NeurIPS
  • [8] Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
  • [9] Large-Scale Machine Learning with Stochastic Gradient Descent
    Bottou, Leon
    [J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
  • [10] Burda Y., 2016, ICLR