Multimodal image-to-image translation between domains with high internal variability

被引：6

作者：

Wang, Jian ^{[1
]}

Lv, Jiancheng ^{[1
]}

Yang, Xue ^{[1
]}

Tang, Chenwei ^{[1
]}

Peng, Xi ^{[1
]}

机构：

[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China

来源：

SOFT COMPUTING | 2020年 / 24卷 / 23期

基金：

美国国家科学基金会; 国家重点研发计划;

关键词：

GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;

D O I：

10.1007/s00500-020-05073-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.

引用

页码：18173 / 18184

页数：12

共 50 条

[41] OSAGGAN: one-shot unsupervised image-to-image translation using attention-guided generative adversarial networks
Huo, Xiaofei
Jiang, Bin
Hu, Haotian
Zhou, Xinjiao
Zhang, Bolin
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3471 - 3482
[42] Unsupervised Domain Adaptation for the Semantic Segmentation of Remote Sensing Images via One-Shot Image-to-Image Translation
Ismael, Sarmad F.
Kayabol, Koray
Aptoula, Erchan
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[43] SEMI-SUPERVISED MULTIMODAL IMAGE TRANSLATION FOR MISSING MODALITY IMPUTATION
Sun, Wangbin
Ma, Fei
Li, Yang
Huang, Shao-Lun
Ni, Shiguang
Zhang, Lin
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4320 - 4324
[44] Multimodal Image Translation Algorithm Based on Singular Squeeze-and-Excitation Network
Tu, Hangyao
Wang, Zheng
Zhao, Yanwei
MATHEMATICS, 2025, 13 (01)
[45] Polarized Image Translation From Nonpolarized Cameras for Multimodal Face Anti-Spoofing
Tian, Yu
Huang, Yalin
Zhang, Kunbo
Liu, Yue
Sun, Zhenan
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 5651 - 5664
[46] Multitask learning for image translation and salient object detection from multimodal remote sensing images
Lian, Yuanfeng
Shi, Xu
Shen, ShaoChen
Hua, Jing
VISUAL COMPUTER, 2024, 40 (03) : 1395 - 1414
[47] Multitask learning for image translation and salient object detection from multimodal remote sensing images
Yuanfeng Lian
Xu Shi
ShaoChen Shen
Jing Hua
The Visual Computer, 2024, 40 : 1395 - 1414
[48] IMAGE TRANSLATION BETWEEN SAR AND OPTICAL IMAGERY WITH GENERATIVE ADVERSARIAL NETS
Enomoto, Kenji
Sakurada, Ken
Wang, Weiming
Kawaguchi, Nobuo
Matsuoka, Masashi
Nakamura, Ryosuke
IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 1752 - 1755
[49] CONVOLUTIONAL NEURAL NETWORK-BASED FRACTAL CODING METHOD FOR IMAGE TRANSLATION IN MULTIMODAL CHANGE DETECTION
Radoi, Anamaria
Unsalan, Melisa
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1063 - 1066
[50] SAR2EO: A High-Resolution Image Translation Framework with Denoising Enhancement
Du, Shenshen
Yu, Jun
Xie, Guochen
Lu, Renjie
Li, Pengwei
Cai, Zhongpeng
Lu, Keda
ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 91 - 102

← 1 2 3 4 5 →