Comparing the latent space of generative models

被引:12
作者
Asperti, Andrea [1 ]
Tonelli, Valerio [1 ]
机构
[1] Univ Bologna, Dept Informat Sci & Engn DISI, Mura Anteo Zamboni 7, I-40126 Bologna, Italy
关键词
Generative models; Latent space; Representation learning; Generative adversarial networks; Variational autoencoders;
D O I
10.1007/s00521-022-07890-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Different encodings of datapoints in the latent space of latent-vector generative models may result in more or less effective and disentangled characterizations of the different explanatory factors of variation behind the data. Many works have been recently devoted to the exploration of the latent space of specific models, mostly focused on the study of how features are disentangled and of how trajectories producing desired alterations of data in the visible space can be found. In this work we address the more general problem of comparing the latent spaces of different models, looking for transformations between them. We confined the investigation to the familiar and largely investigated case of generative models for the data manifold of human faces. The surprising, preliminary result reported in this article is that (provided models have not been taught or explicitly conceived to act differently) a simple linear mapping is enough to pass from a latent space to another while preserving most of the information. This is full of consequences for representation learning, potentially paving the way to the transformation of editing trajectories from one space to another, or the adaptation of disentanglement techniques between different generative domains.
引用
收藏
页码:3155 / 3172
页数:18
相关论文
共 80 条
  • [1] Abdal R, 2020, PROC CVPR IEEE, P8293, DOI 10.1109/CVPR42600.2020.00832
  • [2] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?
    Abdal, Rameen
    Qin, Yipeng
    Wonka, Peter
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4431 - 4440
  • [3] Alaluf Y., 2022, P IEEE CVF C COMP VI, P18511
  • [4] Alemi AA, 2018, PR MACH LEARN RES, V80
  • [5] MimicGAN: Robust Projection onto Image Manifolds with Corruption Mimicking
    Anirudh, Rushil
    Thiagarajan, Jayaraman J.
    Kailkhura, Bhavya
    Bremer, Peer-Timo
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (10-11) : 2459 - 2477
  • [6] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [7] Asperti A., 2021, SN Comput Sci, V2, P1, DOI [10.1007/s42979-021-00702-9, DOI 10.1007/S42979-021-00702-9]
  • [8] Asperti A, 2019, P 1 INT C ADV SIGNAL
  • [9] Enhancing Variational Generation Through Self-Decomposition
    Asperti, Andrea
    Bugo, Laura
    Filippini, Daniele
    [J]. IEEE ACCESS, 2022, 10 : 67510 - 67520
  • [10] Balancing Reconstruction Error and Kullback-Leibler Divergence in Variational Autoencoders
    Asperti, Andrea
    Trentin, Matteo
    [J]. IEEE ACCESS, 2020, 8 : 199440 - 199448