Multimodal image-to-image translation between domains with high internal variability

被引：6

作者：

Wang, Jian ^{[1
]}

Lv, Jiancheng ^{[1
]}

Yang, Xue ^{[1
]}

Tang, Chenwei ^{[1
]}

Peng, Xi ^{[1
]}

机构：

[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China

来源：

SOFT COMPUTING | 2020年 / 24卷 / 23期

基金：

美国国家科学基金会; 国家重点研发计划;

关键词：

GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;

D O I：

10.1007/s00500-020-05073-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.

引用

页码：18173 / 18184

页数：12

共 50 条

[21] Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
Tang, Hao
Xu, Dan
Sebel, Nicu
Yan, Yan
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[22] Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation
Tang, Hao
Torr, Philip H. S.
Sebe, Nicu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6055 - 6071
[23] Cross-Granularity Learning for Multi-Domain Image-to-Image Translation
Fu, Huiyuan
Yu, Ting
Wang, Xin
Ma, Huadong
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3099 - 3107
[24] Multi-Domain Image-to-Image Translation via a Unified Circular Framework
Wang, Yuxi
Zhang, Zhaoxiang
Hao, Wangli
Song, Chunfeng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 670 - 684
[25] Cycle consistent twin energy-based models for image-to-image translation
Tiwary, Piyush
Bhattacharyya, Kinjawl
Prathosh, A. P.
MEDICAL IMAGE ANALYSIS, 2024, 91
[26] Self-attention StarGAN for Multi-domain Image-to-Image Translation
He, Ziliang
Yang, Zhenguo
Mao, Xudong
Lv, Jianming
Li, Qing
Liu, Wenyin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 537 - 549
[27] DuCaGAN: Unified Dual Capsule Generative Adversarial Network for Unsupervised Image-to-Image Translation
Shao, Guifang
Huang, Meng
Gao, Fengqiang
Liu, Tundong
Li, Liduan
IEEE ACCESS, 2020, 8 : 154691 - 154707
[28] Generating Large Labeled Data Sets for Laparoscopic Image Processing Tasks Using Unpaired Image-to-Image Translation
Pfeiffer, Micha
Funke, Isabel
Robu, Maria R.
Bodenstedt, Sebastian
Strenger, Leon
Engelhardt, Sandy
Ross, Tobias
Clarkson, Matthew J.
Gurusamy, Kurinchi
Davidson, Brian R.
Maier-Hein, Lena
Riediger, Carina
Welsch, Thilo
Weitz, Juergen
Speidel, Stefanie
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT V, 2019, 11768 : 119 - 127
[29] Image-to-Image Translation Between Tau Pathology and Neuronal Metabolism PET in Alzheimer Disease with Multi-domain Contrastive Learning
Duong, Michael Tran
Das, Sandhitsu R.
Khandelwal, Pulkit
Lyu, Xueying
Xie, Long
Yushkevich, Paul A.
Wolk, David A.
Nasrallah, Ilya M.
MACHINE LEARNING IN CLINICAL NEUROIMAGING, MLCN 2023, 2023, 14312 : 3 - 13
[30] InstaFormer++: Multi-Domain Instance-Aware Image-to-Image Translation with Transformer
Soohyun Kim
Jongbeom Baek
Jihye Park
Eunjae Ha
Homin Jung
Taeyoung Lee
Seungryong Kim
International Journal of Computer Vision, 2024, 132 : 1167 - 1186

← 1 2 3 4 5 →