Multimodal image-to-image translation between domains with high internal variability

被引：6

作者：

Wang, Jian ^{[1
]}

Lv, Jiancheng ^{[1
]}

Yang, Xue ^{[1
]}

Tang, Chenwei ^{[1
]}

Peng, Xi ^{[1
]}

机构：

[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China

来源：

SOFT COMPUTING | 2020年 / 24卷 / 23期

基金：

美国国家科学基金会; 国家重点研发计划;

关键词：

GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;

D O I：

10.1007/s00500-020-05073-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.

引用

页码：18173 / 18184

页数：12

共 50 条

[31] InstaFormer plus plus : Multi-Domain Instance-Aware Image-to-Image Translation with Transformer
Kim, Soohyun
Baek, Jongbeom
Park, Jihye
Ha, Eunjae
Jung, Homin
Lee, Taeyoung
Kim, Seungryong
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (04) : 1167 - 1186
[32] Fine-grained facial image-to-image translation with an attention based pipeline generative adversarial framework
Yan Zhao
Ziqiang Zheng
Chao Wang
Zhaorui Gu
Min Fu
Zhibin Yu
Haiyong Zheng
Nan Wang
Bing Zheng
Multimedia Tools and Applications, 2020, 79 : 14981 - 15000
[33] SingleGAN: Image-to-Image Translation by a Single-Generator Network Using Multiple Generative Adversarial Learning
Yu, Xiaoming
Cai, Xing
Ying, Zhenqiang
Li, Thomas
Li, Ge
COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 341 - 356
[34] Fine-grained facial image-to-image translation with an attention based pipeline generative adversarial framework
Zhao, Yan
Zheng, Ziqiang
Wang, Chao
Gu, Zhaorui
Fu, Min
Yu, Zhibin
Zheng, Haiyong
Wang, Nan
Zheng, Bing
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14981 - 15000
[35] Adaptive Multi-scale Information Fusion Based on Dynamic Receptive Field for Image-to-image Translation
Yin Mengxiao
Lin Zhenfeng
Yang Feng
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (08) : 2386 - 2394
[36] Cross-Modality LGE-CMR Segmentation Using Image-to-Image Translation Based Data Augmentation
Wang, Wei
Yu, Xinhua
Fang, Bo
Zhao, Yue
Chen, Yongyong
Wei, Wei
Chen, Junxin
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (04) : 2367 - 2375
[37] SEMI2I: SEMANTICALLY CONSISTENT IMAGE-TO-IMAGE TRANSLATION FOR DOMAIN ADAPTATION OF REMOTE SENSING DATA
Tasar, Onur
Happy, S. L.
Tarabalka, Yuliya
Alliez, Pierre
IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1837 - 1840
[38] W2GAN: Importance Weight and Wavelet feature guided Image-to-Image translation under limited data
Yang, Qiuxia
Pu, Yuanyuan
Zhao, Zhengpeng
Xu, Dan
Li, Siqi
COMPUTERS & GRAPHICS-UK, 2023, 116 : 115 - 127
[39] OSAGGAN: one-shot unsupervised image-to-image translation using attention-guided generative adversarial networks
Xiaofei Huo
Bin Jiang
Haotian Hu
Xinjiao Zhou
Bolin Zhang
International Journal of Machine Learning and Cybernetics, 2023, 14 : 3471 - 3482
[40] Unsupervised Object-Level Image-to-Image Translation Using Positional Attention Bi-Flow Generative Network
Yuan, Liuchun
Chen, Dihu
Hu, Haifeng
IEEE ACCESS, 2019, 7 : 30637 - 30647

← 1 2 3 4 5 →