Multimodal image-to-image translation between domains with high internal variability

被引:6
|
作者
Wang, Jian [1 ]
Lv, Jiancheng [1 ]
Yang, Xue [1 ]
Tang, Chenwei [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;
D O I
10.1007/s00500-020-05073-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.
引用
收藏
页码:18173 / 18184
页数:12
相关论文
共 50 条
  • [41] OSAGGAN: one-shot unsupervised image-to-image translation using attention-guided generative adversarial networks
    Huo, Xiaofei
    Jiang, Bin
    Hu, Haotian
    Zhou, Xinjiao
    Zhang, Bolin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3471 - 3482
  • [42] Unsupervised Domain Adaptation for the Semantic Segmentation of Remote Sensing Images via One-Shot Image-to-Image Translation
    Ismael, Sarmad F.
    Kayabol, Koray
    Aptoula, Erchan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [43] SEMI-SUPERVISED MULTIMODAL IMAGE TRANSLATION FOR MISSING MODALITY IMPUTATION
    Sun, Wangbin
    Ma, Fei
    Li, Yang
    Huang, Shao-Lun
    Ni, Shiguang
    Zhang, Lin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4320 - 4324
  • [44] Multimodal Image Translation Algorithm Based on Singular Squeeze-and-Excitation Network
    Tu, Hangyao
    Wang, Zheng
    Zhao, Yanwei
    MATHEMATICS, 2025, 13 (01)
  • [45] Polarized Image Translation From Nonpolarized Cameras for Multimodal Face Anti-Spoofing
    Tian, Yu
    Huang, Yalin
    Zhang, Kunbo
    Liu, Yue
    Sun, Zhenan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 5651 - 5664
  • [46] Multitask learning for image translation and salient object detection from multimodal remote sensing images
    Lian, Yuanfeng
    Shi, Xu
    Shen, ShaoChen
    Hua, Jing
    VISUAL COMPUTER, 2024, 40 (03) : 1395 - 1414
  • [47] Multitask learning for image translation and salient object detection from multimodal remote sensing images
    Yuanfeng Lian
    Xu Shi
    ShaoChen Shen
    Jing Hua
    The Visual Computer, 2024, 40 : 1395 - 1414
  • [48] IMAGE TRANSLATION BETWEEN SAR AND OPTICAL IMAGERY WITH GENERATIVE ADVERSARIAL NETS
    Enomoto, Kenji
    Sakurada, Ken
    Wang, Weiming
    Kawaguchi, Nobuo
    Matsuoka, Masashi
    Nakamura, Ryosuke
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 1752 - 1755
  • [49] CONVOLUTIONAL NEURAL NETWORK-BASED FRACTAL CODING METHOD FOR IMAGE TRANSLATION IN MULTIMODAL CHANGE DETECTION
    Radoi, Anamaria
    Unsalan, Melisa
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1063 - 1066
  • [50] SAR2EO: A High-Resolution Image Translation Framework with Denoising Enhancement
    Du, Shenshen
    Yu, Jun
    Xie, Guochen
    Lu, Renjie
    Li, Pengwei
    Cai, Zhongpeng
    Lu, Keda
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 91 - 102