Multimodal image-to-image translation between domains with high internal variability

被引:6
|
作者
Wang, Jian [1 ]
Lv, Jiancheng [1 ]
Yang, Xue [1 ]
Tang, Chenwei [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;
D O I
10.1007/s00500-020-05073-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.
引用
收藏
页码:18173 / 18184
页数:12
相关论文
共 50 条
  • [31] InstaFormer plus plus : Multi-Domain Instance-Aware Image-to-Image Translation with Transformer
    Kim, Soohyun
    Baek, Jongbeom
    Park, Jihye
    Ha, Eunjae
    Jung, Homin
    Lee, Taeyoung
    Kim, Seungryong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (04) : 1167 - 1186
  • [32] Fine-grained facial image-to-image translation with an attention based pipeline generative adversarial framework
    Yan Zhao
    Ziqiang Zheng
    Chao Wang
    Zhaorui Gu
    Min Fu
    Zhibin Yu
    Haiyong Zheng
    Nan Wang
    Bing Zheng
    Multimedia Tools and Applications, 2020, 79 : 14981 - 15000
  • [33] SingleGAN: Image-to-Image Translation by a Single-Generator Network Using Multiple Generative Adversarial Learning
    Yu, Xiaoming
    Cai, Xing
    Ying, Zhenqiang
    Li, Thomas
    Li, Ge
    COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 341 - 356
  • [34] Fine-grained facial image-to-image translation with an attention based pipeline generative adversarial framework
    Zhao, Yan
    Zheng, Ziqiang
    Wang, Chao
    Gu, Zhaorui
    Fu, Min
    Yu, Zhibin
    Zheng, Haiyong
    Wang, Nan
    Zheng, Bing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14981 - 15000
  • [35] Adaptive Multi-scale Information Fusion Based on Dynamic Receptive Field for Image-to-image Translation
    Yin Mengxiao
    Lin Zhenfeng
    Yang Feng
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (08) : 2386 - 2394
  • [36] Cross-Modality LGE-CMR Segmentation Using Image-to-Image Translation Based Data Augmentation
    Wang, Wei
    Yu, Xinhua
    Fang, Bo
    Zhao, Yue
    Chen, Yongyong
    Wei, Wei
    Chen, Junxin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (04) : 2367 - 2375
  • [37] SEMI2I: SEMANTICALLY CONSISTENT IMAGE-TO-IMAGE TRANSLATION FOR DOMAIN ADAPTATION OF REMOTE SENSING DATA
    Tasar, Onur
    Happy, S. L.
    Tarabalka, Yuliya
    Alliez, Pierre
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1837 - 1840
  • [38] W2GAN: Importance Weight and Wavelet feature guided Image-to-Image translation under limited data
    Yang, Qiuxia
    Pu, Yuanyuan
    Zhao, Zhengpeng
    Xu, Dan
    Li, Siqi
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 115 - 127
  • [39] OSAGGAN: one-shot unsupervised image-to-image translation using attention-guided generative adversarial networks
    Xiaofei Huo
    Bin Jiang
    Haotian Hu
    Xinjiao Zhou
    Bolin Zhang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3471 - 3482
  • [40] Unsupervised Object-Level Image-to-Image Translation Using Positional Attention Bi-Flow Generative Network
    Yuan, Liuchun
    Chen, Dihu
    Hu, Haifeng
    IEEE ACCESS, 2019, 7 : 30637 - 30647