Multimodal image-to-image translation between domains with high internal variability

被引：6

作者：

Wang, Jian ^{[1
]}

Lv, Jiancheng ^{[1
]}

Yang, Xue ^{[1
]}

Tang, Chenwei ^{[1
]}

Peng, Xi ^{[1
]}

机构：

[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China

来源：

SOFT COMPUTING | 2020年 / 24卷 / 23期

基金：

美国国家科学基金会; 国家重点研发计划;

关键词：

GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;

D O I：

10.1007/s00500-020-05073-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.

引用

页码：18173 / 18184

页数：12

共 50 条

[1] Multimodal image-to-image translation between domains with high internal variability
Jian Wang
Jiancheng Lv
Xue Yang
Chenwei Tang
Xi Peng
Soft Computing, 2020, 24 : 18173 - 18184
[2] Multimodal Unsupervised Image-to-Image Translation
Huang, Xun
Liu, Ming-Yu
Belongie, Serge
Kautz, Jan
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196
[3] Image-to-image translation for wavefront and PSF estimation
Smith, Jeffrey
Cranney, Jesse
Gretton, Charles
Gratadour, Damien
ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185
[4] VecGAN: Image-to-Image Translation with Interpretable Latent Directions
Dalva, Yusuf
Altindis, Said Fahri
Dundar, Aysegul
COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 153 - 169
[5] Unpaired Image-to-Image Translation with Diffusion Adversarial Network
Tu, Hangyao
Wang, Zheng
Zhao, Yanwei
MATHEMATICS, 2024, 12 (20)
[6] A Diffusion Model Translator for Efficient Image-to-Image Translation
Xia, Mengfei
Zhou, Yu
Yi, Ran
Liu, Yong-Jin
Wang, Wenping
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10272 - 10283
[7] Facial Feature Based Image-to-Image Translation Method
Kang, Shinjin
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (12): : 4835 - 4848
[8] Improving Shape Deformation in Unsupervised Image-to-Image Translation
Gokaslan, Aaron
Ramanujan, Vivek
Ritchie, Daniel
Kim, Kwang In
Tompkin, James
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 662 - 678
[9] Unified Generative Adversarial Networks for Controllable Image-to-Image Translation
Tang, Hao
Liu, Hong
Sebe, Nicu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8916 - 8929
[10] Literature Review of Generative models for Image-to-Image translation problems
Kamil, Anwar
Shaikh, Talal
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 341 - 346

← 1 2 3 4 5 →