MaskAAE: Latent space optimization for Adversarial Auto-Encoders

被引:0
|
作者
Mondal, Arnab Kumar [1 ]
Chowdhury, Sankalan Pal [1 ]
Jayendran, Aravind [1 ,2 ]
Singla, Parag [1 ]
Asnani, Himanshu [3 ]
Prathosh, A. P. [1 ]
机构
[1] IIT Delhi, Delhi, India
[2] Flipkart Internet Pvt Ltd, Bengaluru, Karnataka, India
[3] TIFR, Mumbai, Maharashtra, India
来源
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020) | 2020年 / 124卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The field of neural generative models is dominated by the highly successful Generative Adversarial Networks (GANs) despite their challenges, such as training instability and mode collapse. Auto-Encoders (AE) with regularized latent space provide an alternative framework for generative models, albeit their performance levels have not reached that of GANs. In this work, we hypothesise that the dimensionality of the AE model's latent space has a critical effect on the quality of generated data. Under the assumption that nature generates data by sampling from a "true" generative latent space followed by a deterministic function, we show that the optimal performance is obtained when the dimensionality of the latent space of the AE-model matches with that of the "true" generative latent space. Further, we propose an algorithm called the Mask Adversarial Auto-Encoder (MaskAAE), in which the dimensionality of the latent space of an adversarial auto encoder is brought closer to that of the "true" generative latent space, via a procedure to mask the spurious latent dimensions. We demonstrate through experiments on synthetic and several real-world datasets that the proposed formulation yields betterment in the generation quality.
引用
收藏
页码:689 / 698
页数:10
相关论文
共 50 条
  • [1] Tessellating the Latent Space for Non-Adversarial Generative Auto-Encoders
    Gai, Kuo
    Zhang, Shihua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 780 - 792
  • [2] Adversarial Auto-encoders for Speech Based Emotion Recognition
    Sahu, Saurabh
    Gupta, Rahul
    Sivaraman, Ganesh
    AbdAlmageed, Wael
    Espy-Wilson, Carol
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1243 - 1247
  • [3] Automatic selection of latent variables in variational auto-encoders
    Jouffroy, Emma
    Giremus, Audrey
    Berthoumieu, Yannick
    Bach, Olivier
    Hugget, Alain
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1407 - 1411
  • [4] Discriminative regularization of the latent manifold of variational auto-encoders
    Kossyk, Ingo
    Marton, Zoltan-Csaba
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 121 - 129
  • [5] Latent Space Interpolation of Synthesizer Parameters Using Timbre-Regularized Auto-Encoders
    Le Vaillant, Gwendal
    Dutoit, Thierry
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3379 - 3392
  • [6] Generation and Extraction of Color Palettes with Adversarial Variational Auto-Encoders
    Moussa, Ahmad
    Watanabe, Hiroshi
    PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 889 - 897
  • [7] Fisher Auto-Encoders
    Elkhalil, Khalil
    Hasan, Ali
    Ding, Jie
    Farsiu, Sina
    Tarokh, Vahid
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 352 - 360
  • [8] Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond
    Fei, Zhengcong
    Fan, Mingyuan
    Zhu, Li
    Huang, Junshi
    Wei, Xiaoming
    Wei, Xiaolin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24449 - 24459
  • [9] Ornstein Auto-Encoders
    Choi, Youngwon
    Won, Joong-Ho
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2172 - 2178
  • [10] Transforming Auto-Encoders
    Hinton, Geoffrey E.
    Krizhevsky, Alex
    Wang, Sida D.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 : 44 - 51