Multimodal image-to-image translation between domains with high internal variability

被引:6
|
作者
Wang, Jian [1 ]
Lv, Jiancheng [1 ]
Yang, Xue [1 ]
Tang, Chenwei [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;
D O I
10.1007/s00500-020-05073-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.
引用
收藏
页码:18173 / 18184
页数:12
相关论文
共 50 条
  • [1] Multimodal image-to-image translation between domains with high internal variability
    Jian Wang
    Jiancheng Lv
    Xue Yang
    Chenwei Tang
    Xi Peng
    Soft Computing, 2020, 24 : 18173 - 18184
  • [2] Multimodal Unsupervised Image-to-Image Translation
    Huang, Xun
    Liu, Ming-Yu
    Belongie, Serge
    Kautz, Jan
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196
  • [3] Image-to-image translation for wavefront and PSF estimation
    Smith, Jeffrey
    Cranney, Jesse
    Gretton, Charles
    Gratadour, Damien
    ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185
  • [4] VecGAN: Image-to-Image Translation with Interpretable Latent Directions
    Dalva, Yusuf
    Altindis, Said Fahri
    Dundar, Aysegul
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 153 - 169
  • [5] Unpaired Image-to-Image Translation with Diffusion Adversarial Network
    Tu, Hangyao
    Wang, Zheng
    Zhao, Yanwei
    MATHEMATICS, 2024, 12 (20)
  • [6] A Diffusion Model Translator for Efficient Image-to-Image Translation
    Xia, Mengfei
    Zhou, Yu
    Yi, Ran
    Liu, Yong-Jin
    Wang, Wenping
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10272 - 10283
  • [7] Facial Feature Based Image-to-Image Translation Method
    Kang, Shinjin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (12): : 4835 - 4848
  • [8] Improving Shape Deformation in Unsupervised Image-to-Image Translation
    Gokaslan, Aaron
    Ramanujan, Vivek
    Ritchie, Daniel
    Kim, Kwang In
    Tompkin, James
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 662 - 678
  • [9] Unified Generative Adversarial Networks for Controllable Image-to-Image Translation
    Tang, Hao
    Liu, Hong
    Sebe, Nicu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8916 - 8929
  • [10] Literature Review of Generative models for Image-to-Image translation problems
    Kamil, Anwar
    Shaikh, Talal
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 341 - 346