Posterior Consistency for Missing Data in Variational Autoencoders

被引:0
作者
Sudak, Timur [1 ]
Tschiatschek, Sebastian [1 ,2 ]
机构
[1] Univ Vienna, Fac Comp Sci, Vienna, Austria
[2] Univ Vienna, Res Network Data Sci, Vienna, Austria
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II | 2023年 / 14170卷
关键词
Variational Autoencoders; Missing Data;
D O I
10.1007/978-3-031-43415-0_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of learning Variational Autoencoders (VAEs), i.e., a type of deep generative model, from data with missing values. Such data is omnipresent in real-world applications of machine learning because complete data is often impossible or too costly to obtain. We particularly focus on improving a VAE's amortized posterior inference, i.e., the encoder, which in the case of missing data can be susceptible to learning inconsistent posterior distributions regarding the missingness. To this end, we provide a formal definition of posterior consistency and propose an approach for regularizing an encoder's posterior distribution which promotes this consistency. We observe that the proposed regularization suggests a different training objective than that typically considered in the literature when facing missing values. Furthermore, we empirically demonstrate that our regularization leads to improved performance in missing value settings in terms of reconstruction quality and downstream tasks utilizing uncertainty in the latent space. This improved performance can be observed for many classes of VAEs including VAEs equipped with normalizing flows.
引用
收藏
页码:508 / 524
页数:17
相关论文
共 44 条
  • [11] Goodfellow I, 2017, Arxiv, DOI arXiv:1701.00160
  • [12] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [13] Ipsen N.B., 2021, INT C LEARNING REPRE
  • [14] Kingma D. P., 2014, arXiv
  • [15] Kingma D. P., 2015, PROC INT C LEARN REP
  • [16] Generative Face Completion
    Li, Yijun
    Liu, Sifei
    Yang, Jimei
    Yang, Ming-Hsuan
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5892 - 5900
  • [17] Lim DK, 2022, Arxiv, DOI [arXiv:2101.07357, DOI 10.48550/ARXIV.2101.07357,ARXIV.ORG/ABS/2101.07357, 10.48550/ARXIV.2101.07357,arxiv.org/abs/2101.07357]
  • [18] Little RJ, 2019, STAT ANAL MISSING DA, V793
  • [19] Little RJA, 2019, Wiley Series in Probability and Statistics, VThird, P1
  • [20] Liu Y., 2020, Towards consistent variational auto-encoding, P13869