Comparing the latent space of generative models

被引：14

作者：

Asperti, Andrea ^{[1
]}

Tonelli, Valerio ^{[1
]}

机构：

[1] Univ Bologna, Dept Informat Sci & Engn DISI, Mura Anteo Zamboni 7, I-40126 Bologna, Italy

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 04期

关键词：

Generative models; Latent space; Representation learning; Generative adversarial networks; Variational autoencoders;

D O I：

10.1007/s00521-022-07890-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Different encodings of datapoints in the latent space of latent-vector generative models may result in more or less effective and disentangled characterizations of the different explanatory factors of variation behind the data. Many works have been recently devoted to the exploration of the latent space of specific models, mostly focused on the study of how features are disentangled and of how trajectories producing desired alterations of data in the visible space can be found. In this work we address the more general problem of comparing the latent spaces of different models, looking for transformations between them. We confined the investigation to the familiar and largely investigated case of generative models for the data manifold of human faces. The surprising, preliminary result reported in this article is that (provided models have not been taught or explicitly conceived to act differently) a simple linear mapping is enough to pass from a latent space to another while preserving most of the information. This is full of consequences for representation learning, potentially paving the way to the transformation of editing trajectories from one space to another, or the adaptation of disentanglement techniques between different generative domains.

引用

页码：3155 / 3172

页数：18

共 80 条

[1] Image2StyleGAN++: How to Edit the Embedded Images? [J].

Abdal, Rameen ;

Qin, Yipeng ;

Wonka, Peter .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8293-8302

[2] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? [J].

Abdal, Rameen ;

Qin, Yipeng ;

Wonka, Peter .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4431-4440

[3]

Alaluf Y, 2022, P IEEE CVF C COMP VI, P18511

[4]

Alemi AA, 2018, PR MACH LEARN RES, V80

[5] MimicGAN: Robust Projection onto Image Manifolds with Corruption Mimicking [J].

Anirudh, Rushil ;

Thiagarajan, Jayaraman J. ;

Kailkhura, Bhavya ;

Bremer, Peer-Timo .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (10-11) :2459-2477

[6]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[7]

Asperti A., 2021, SN Comput Sci, V2, P301, DOI DOI 10.1007/S42979-021-00702-9

[8]

Asperti A, 2019, P 1 INT C ADV SIGNAL

[9] Enhancing Variational Generation Through Self-Decomposition [J].

Asperti, Andrea ;

Bugo, Laura ;

Filippini, Daniele .

IEEE ACCESS, 2022, 10 :67510-67520

[10] Balancing Reconstruction Error and Kullback-Leibler Divergence in Variational Autoencoders [J].

Asperti, Andrea ;

Trentin, Matteo .

IEEE ACCESS, 2020, 8 :199440-199448

← 1 2 3 4 5 6 7 8 →