Asymmetric slack contrastive learning for full use of feature information in image translation

被引:0
作者
Zhang, Yusen [1 ]
Li, Min [1 ]
Gou, Yao [1 ]
He, Yujie [1 ]
机构
[1] Xian Res Inst Hitech, Xian 710025, Shaanxi, Peoples R China
关键词
Image translation; Cross-domain learning; Asymmetric slack contrast; Contrastive learning; Structure consistency;
D O I
10.1016/j.knosys.2024.112136
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, contrastive learning has been proven to be powerful in cross -domain feature learning and has been widely used in image translation tasks. However, these methods often overlook the differences between positive and negative samples regarding model optimization ability and treat them equally. This weakens the feature representation ability of the generative models. In this paper, we propose a novel image translation model based on asymmetric slack contrastive learning. We design a new contrastive loss asymmetrically by introducing a slack adjustment factor. Theoretical analysis shows that it can adaptively optimize and adjust according to different positive and negative samples and significantly improve optimization efficiency. In addition, to better preserve local structural relationships during image translation, we constructed a regional differential structural consistency correction block using differential vectors. Comparative experiments were conducted using seven existing methods on five datasets. The results indicate that our method can maintain structural consistency between cross -domain images at a deeper level. Furthermore, it is more effective in establishing real image -domain mapping relations, resulting in higher -quality images being generated.
引用
收藏
页数:14
相关论文
共 47 条
  • [1] HyperStyle: Sty1eGAN Inversion with HyperNetworks for Real Image Editing
    Alaluf, Yuval
    Tov, Omer
    Mokady, Ron
    Gal, Rinon
    Bermano, Amit
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18490 - 18500
  • [2] DualAST: Dual Style-Learning Networks for Artistic Style Transfer
    Chen, Haibo
    Zhao, Lei
    Wang, Zhizhong
    Zhang, Huiming
    Zuo, Zhiwen
    Li, Ailin
    Xing, Wei
    Lu, Dongming
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 872 - 881
  • [3] Chen JY, 2021, Arxiv, DOI arXiv:2107.01152
  • [4] Chen T, 2020, PR MACH LEARN RES, V119
  • [5] Choi Y, 2020, PROC CVPR IEEE, P8185, DOI 10.1109/CVPR42600.2020.00821
  • [6] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [8] Multi-feature contrastive learning for unpaired image-to-image translation
    Gou, Yao
    Li, Min
    Song, Yu
    He, Yujie
    Wang, Litao
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4111 - 4122
  • [9] Gutmann Michael, 2010, P 13 INT C ART INT S, P297
  • [10] Dual Contrastive Learning for Unsupervised Image-to-Image Translation
    Han, Junlin
    Shoeiby, Mehrdad
    Petersson, Lars
    Armin, Mohammad Ali
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 746 - 755