Multimodal image-to-image translation between domains with high internal variability

被引:6
|
作者
Wang, Jian [1 ]
Lv, Jiancheng [1 ]
Yang, Xue [1 ]
Tang, Chenwei [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;
D O I
10.1007/s00500-020-05073-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.
引用
收藏
页码:18173 / 18184
页数:12
相关论文
共 50 条
  • [21] Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
    Tang, Hao
    Xu, Dan
    Sebel, Nicu
    Yan, Yan
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [22] Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation
    Tang, Hao
    Torr, Philip H. S.
    Sebe, Nicu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6055 - 6071
  • [23] Cross-Granularity Learning for Multi-Domain Image-to-Image Translation
    Fu, Huiyuan
    Yu, Ting
    Wang, Xin
    Ma, Huadong
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3099 - 3107
  • [24] Multi-Domain Image-to-Image Translation via a Unified Circular Framework
    Wang, Yuxi
    Zhang, Zhaoxiang
    Hao, Wangli
    Song, Chunfeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 670 - 684
  • [25] Cycle consistent twin energy-based models for image-to-image translation
    Tiwary, Piyush
    Bhattacharyya, Kinjawl
    Prathosh, A. P.
    MEDICAL IMAGE ANALYSIS, 2024, 91
  • [26] Self-attention StarGAN for Multi-domain Image-to-Image Translation
    He, Ziliang
    Yang, Zhenguo
    Mao, Xudong
    Lv, Jianming
    Li, Qing
    Liu, Wenyin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 537 - 549
  • [27] DuCaGAN: Unified Dual Capsule Generative Adversarial Network for Unsupervised Image-to-Image Translation
    Shao, Guifang
    Huang, Meng
    Gao, Fengqiang
    Liu, Tundong
    Li, Liduan
    IEEE ACCESS, 2020, 8 : 154691 - 154707
  • [28] Generating Large Labeled Data Sets for Laparoscopic Image Processing Tasks Using Unpaired Image-to-Image Translation
    Pfeiffer, Micha
    Funke, Isabel
    Robu, Maria R.
    Bodenstedt, Sebastian
    Strenger, Leon
    Engelhardt, Sandy
    Ross, Tobias
    Clarkson, Matthew J.
    Gurusamy, Kurinchi
    Davidson, Brian R.
    Maier-Hein, Lena
    Riediger, Carina
    Welsch, Thilo
    Weitz, Juergen
    Speidel, Stefanie
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT V, 2019, 11768 : 119 - 127
  • [29] Image-to-Image Translation Between Tau Pathology and Neuronal Metabolism PET in Alzheimer Disease with Multi-domain Contrastive Learning
    Duong, Michael Tran
    Das, Sandhitsu R.
    Khandelwal, Pulkit
    Lyu, Xueying
    Xie, Long
    Yushkevich, Paul A.
    Wolk, David A.
    Nasrallah, Ilya M.
    MACHINE LEARNING IN CLINICAL NEUROIMAGING, MLCN 2023, 2023, 14312 : 3 - 13
  • [30] InstaFormer++: Multi-Domain Instance-Aware Image-to-Image Translation with Transformer
    Soohyun Kim
    Jongbeom Baek
    Jihye Park
    Eunjae Ha
    Homin Jung
    Taeyoung Lee
    Seungryong Kim
    International Journal of Computer Vision, 2024, 132 : 1167 - 1186