MaskAAE: Latent space optimization for Adversarial Auto-Encoders

被引：0

作者：

Mondal, Arnab Kumar ^{[1
]}

Chowdhury, Sankalan Pal ^{[1
]}

Jayendran, Aravind ^{[1
,2
]}

Singla, Parag ^{[1
]}

Asnani, Himanshu ^{[3
]}

Prathosh, A. P. ^{[1
]}

机构：

[1] IIT Delhi, Delhi, India

[2] Flipkart Internet Pvt Ltd, Bengaluru, Karnataka, India

[3] TIFR, Mumbai, Maharashtra, India

来源：

CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020) | 2020年 / 124卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The field of neural generative models is dominated by the highly successful Generative Adversarial Networks (GANs) despite their challenges, such as training instability and mode collapse. Auto-Encoders (AE) with regularized latent space provide an alternative framework for generative models, albeit their performance levels have not reached that of GANs. In this work, we hypothesise that the dimensionality of the AE model's latent space has a critical effect on the quality of generated data. Under the assumption that nature generates data by sampling from a "true" generative latent space followed by a deterministic function, we show that the optimal performance is obtained when the dimensionality of the latent space of the AE-model matches with that of the "true" generative latent space. Further, we propose an algorithm called the Mask Adversarial Auto-Encoder (MaskAAE), in which the dimensionality of the latent space of an adversarial auto encoder is brought closer to that of the "true" generative latent space, via a procedure to mask the spurious latent dimensions. We demonstrate through experiments on synthetic and several real-world datasets that the proposed formulation yields betterment in the generation quality.

引用

页码：689 / 698

页数：10

共 50 条

[1] Tessellating the Latent Space for Non-Adversarial Generative Auto-Encoders
Gai, Kuo
Zhang, Shihua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 780 - 792
[2] Adversarial Auto-encoders for Speech Based Emotion Recognition
Sahu, Saurabh
Gupta, Rahul
Sivaraman, Ganesh
AbdAlmageed, Wael
Espy-Wilson, Carol
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1243 - 1247
[3] Automatic selection of latent variables in variational auto-encoders
Jouffroy, Emma
Giremus, Audrey
Berthoumieu, Yannick
Bach, Olivier
Hugget, Alain
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1407 - 1411
[4] Discriminative regularization of the latent manifold of variational auto-encoders
Kossyk, Ingo
Marton, Zoltan-Csaba
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 121 - 129
[5] Latent Space Interpolation of Synthesizer Parameters Using Timbre-Regularized Auto-Encoders
Le Vaillant, Gwendal
Dutoit, Thierry
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3379 - 3392
[6] Generation and Extraction of Color Palettes with Adversarial Variational Auto-Encoders
Moussa, Ahmad
Watanabe, Hiroshi
PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 889 - 897
[7] Fisher Auto-Encoders
Elkhalil, Khalil
Hasan, Ali
Ding, Jie
Farsiu, Sina
Tarokh, Vahid
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 352 - 360
[8] Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond
Fei, Zhengcong
Fan, Mingyuan
Zhu, Li
Huang, Junshi
Wei, Xiaoming
Wei, Xiaolin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24449 - 24459
[9] Ornstein Auto-Encoders
Choi, Youngwon
Won, Joong-Ho
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2172 - 2178
[10] Transforming Auto-Encoders
Hinton, Geoffrey E.
Krizhevsky, Alex
Wang, Sida D.
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 : 44 - 51

← 1 2 3 4 5 →