Spectral normalization and dual contrastive regularization for image-to-image translation

被引:6
作者
Zhao, Chen [1 ]
Cai, Wei-Ling [1 ]
Yuan, Zheng [1 ]
机构
[1] Nanjing Normal Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Image-to-image translation; Contrastive learning; Generative adversarial network;
D O I
10.1007/s00371-024-03314-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing image-to-image (I2I) translation methods achieve state-of-the-art performance by incorporating the patch-wise contrastive learning into generative adversarial networks. However, patch-wise contrastive learning only focuses on the local content similarity but neglects the global structure constraint, which affects the quality of the generated images. In this paper, we propose a new unpaired I2I translation framework based on dual contrastive regularization and spectral normalization, namely SN-DCR. To maintain consistency of the global structure and texture, we design the dual contrastive regularization using different deep feature spaces respectively. In order to improve the global structure information of the generated images, we formulate a semantic contrastive loss to make the global semantic structure of the generated images similar to the real images from the target domain in the semantic feature space. We use gram matrices to extract the style of texture from images. Similarly, we design a style contrastive loss to improve the global texture information of the generated images. Moreover, to enhance the stability of the model, we employ the spectral normalized convolutional network in the design of our generator. We conduct comprehensive experiments to evaluate the effectiveness of SN-DCR, and the results prove that our method achieves SOTA in multiple tasks. The code and pretrained models are available at https://github.com/zhihefang/SN-DCR.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 43 条
  • [1] Benaim S, 2017, ADV NEUR IN, V30
  • [2] Caron M., 2020, NEURAL INF PROCESS S
  • [3] Chang Y., 2023, IEEE Trans. Pattern Anal. Mach. Intell.
  • [4] Chen T, 2020, PMLR, V119, P1597
  • [5] Exploring Simple Siamese Representation Learning
    Chen, Xinlei
    He, Kaiming
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15745 - 15753
  • [6] Choi Y, 2020, PROC CVPR IEEE, P8185, DOI 10.1109/CVPR42600.2020.00821
  • [7] Fu H, 2019, PROC CVPR IEEE, P2422, DOI [10.1109/CVPR.2019.00253, 10.1109/cvpr.2019.00253]
  • [8] Image Style Transfer Using Convolutional Neural Networks
    Gatys, Leon A.
    Ecker, Alexander S.
    Bethge, Matthias
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2414 - 2423
  • [9] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [10] Multi-feature contrastive learning for unpaired image-to-image translation
    Gou, Yao
    Li, Min
    Song, Yu
    He, Yujie
    Wang, Litao
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4111 - 4122