Spectral normalization and dual contrastive regularization for image-to-image translation

被引：6

作者：

Zhao, Chen ^{[1
]}

Cai, Wei-Ling ^{[1
]}

Yuan, Zheng ^{[1
]}

机构：

[1] Nanjing Normal Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China

来源：

VISUAL COMPUTER | 2025年 / 41卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Image-to-image translation; Contrastive learning; Generative adversarial network;

D O I：

10.1007/s00371-024-03314-5

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Existing image-to-image (I2I) translation methods achieve state-of-the-art performance by incorporating the patch-wise contrastive learning into generative adversarial networks. However, patch-wise contrastive learning only focuses on the local content similarity but neglects the global structure constraint, which affects the quality of the generated images. In this paper, we propose a new unpaired I2I translation framework based on dual contrastive regularization and spectral normalization, namely SN-DCR. To maintain consistency of the global structure and texture, we design the dual contrastive regularization using different deep feature spaces respectively. In order to improve the global structure information of the generated images, we formulate a semantic contrastive loss to make the global semantic structure of the generated images similar to the real images from the target domain in the semantic feature space. We use gram matrices to extract the style of texture from images. Similarly, we design a style contrastive loss to improve the global texture information of the generated images. Moreover, to enhance the stability of the model, we employ the spectral normalized convolutional network in the design of our generator. We conduct comprehensive experiments to evaluate the effectiveness of SN-DCR, and the results prove that our method achieves SOTA in multiple tasks. The code and pretrained models are available at https://github.com/zhihefang/SN-DCR.

引用

页码：129 / 140

页数：12

共 43 条

[1] Benaim S, 2017, ADV NEUR IN, V30
[2] Caron M., 2020, NEURAL INF PROCESS S
[3] Chang Y., 2023, IEEE Trans. Pattern Anal. Mach. Intell.
[4] Chen T, 2020, PMLR, V119, P1597
[5] Exploring Simple Siamese Representation Learning
Chen, Xinlei
He, Kaiming
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15745 - 15753
[6] Choi Y, 2020, PROC CVPR IEEE, P8185, DOI 10.1109/CVPR42600.2020.00821
[7] Fu H, 2019, PROC CVPR IEEE, P2422, DOI [10.1109/CVPR.2019.00253, 10.1109/cvpr.2019.00253]
[8] Image Style Transfer Using Convolutional Neural Networks
Gatys, Leon A.
Ecker, Alexander S.
Bethge, Matthias
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2414 - 2423
[9] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
[10] Multi-feature contrastive learning for unpaired image-to-image translation
Gou, Yao
Li, Min
Song, Yu
He, Yujie
Wang, Litao
[J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4111 - 4122

← 1 2 3 4 5 →