Multilevel contrast strategy for unpaired image-to-image translation

被引：1

作者：

Han, Minggui ^{[1
]}

Shao, Mingwen ^{[1
]}

Meng, Lingzhuang ^{[1
]}

Liu, Yuexian ^{[1
]}

Qiao, Yuanjian ^{[1
]}

机构：

[1] China Univ Petr, Qingdao Inst Software, Coll Comp Sci & Technol, Qingdao, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2023年 / 32卷 / 06期

基金：

中国国家自然科学基金;

关键词：

image-to-image translation; contrastive learning; multilevel contrast strategy; GENERATIVE ADVERSARIAL NETWORK; REPRESENTATION;

D O I：

10.1117/1.JEI.32.6.063030

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Contrastive learning for unpaired image-to-image translation utilizes adversarial loss to ensure the realism of generated images in the target domain and incorporates pixel-wise contrastive loss to maximize the correlation between them. However, existing methods only contrast pixel-wise features, ignoring higher-level features, and the pixel-wise contrast is imperfect, which leads to poorer perceptual and visual results. In order to alleviate these problems, we propose an effective multilevel contrast strategy for unpaired image-to-image translation (MLCUT), which contrasts features at three levels to generate more harmonious and realistic images. Specifically, we strengthen the pixel-wise level contrast and introduce the contrasts of plane and voxel-wise levels. On the one hand, MLCUT enhances training effectiveness by picking over hard negative keys for each query at the pixel-wise level. On the other hand, we strengthen the learning preferences of generators on features of objects rather than backgrounds by contrasting the plane-wise discriminative matrices in adversarial loss. Furthermore, by contrasting voxel-wise global semantic vectors, MLCUT effectively improves the realism of generated images and avoids mode collapse. Qualitative and quantitative experiments demonstrate that our method effectively improves performance in perception and vision.

引用

页数：18

共 50 条

[31] Image-to-Image Translation: Methods and Applications
Pang, Yingxue
Lin, Jianxin
Qin, Tao
Chen, Zhibo
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3859 - 3881
[32] Multimodal Unsupervised Image-to-Image Translation
Huang, Xun
Liu, Ming-Yu
Belongie, Serge
Kautz, Jan
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196
[33] Hierarchical image-to-image translation with nested distributions modeling
Qiao, Shishi
Wang, Ruiping
Shan, Shiguang
Chen, Xilin
PATTERN RECOGNITION, 2024, 146
[34] A novel framework for image-to-image translation and image compression
Yang, Fei
Wang, Yaxing
Herranz, Luis
Cheng, Yongmei
Mozerov, Mikhail G.
NEUROCOMPUTING, 2022, 508 : 58 - 70
[35] Guided Image Weathering using Image-to-Image Translation
Chen, Yu
Shen, I-Chao
Chen, Bing-Yu
PROCEEDINGS OF SIGGRAPH ASIA 2021 TECHNICAL COMMUNICATIONS, 2021,
[36] Enhancing Image-to-Image Translation with Contrast Loss Constrained Generators and Selective Neighborhood Sampling
Li, Meifang
Wang, Xiaoru
Bian, Dexin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 415 - 426
[37] Towards semantically continuous unpaired image-to-image translation via margin adaptive contrastive learning and wavelet transform
Zhang, Heng
Yang, Yi-Jun
Zeng, Wei
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
[38] Spatial-Intensity Transforms for Medical Image-to-Image Translation
Wang, Clinton J.
Rost, Natalia S.
Golland, Polina
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (11) : 3362 - 3373
[39] Unsupervised Exemplar-Domain Aware Image-to-Image Translation
Fu, Yuanbin
Ma, Jiayi
Guo, Xiaojie
ENTROPY, 2021, 23 (05)
[40] Image-to-Image Translation Based Face De-Occlusion
Maharjan, Rahul S.
Din, Nizam Ud
Yi, Juneho
TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2020), 2020, 11519

← 1 2 3 4 5 →