Asymmetric slack contrastive learning for full use of feature information in image translation

被引：0

作者：

Zhang, Yusen ^{[1
]}

Li, Min ^{[1
]}

Gou, Yao ^{[1
]}

He, Yujie ^{[1
]}

机构：

[1] Xian Res Inst Hitech, Xian 710025, Shaanxi, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 299卷

关键词：

Image translation; Cross-domain learning; Asymmetric slack contrast; Contrastive learning; Structure consistency;

D O I：

10.1016/j.knosys.2024.112136

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, contrastive learning has been proven to be powerful in cross -domain feature learning and has been widely used in image translation tasks. However, these methods often overlook the differences between positive and negative samples regarding model optimization ability and treat them equally. This weakens the feature representation ability of the generative models. In this paper, we propose a novel image translation model based on asymmetric slack contrastive learning. We design a new contrastive loss asymmetrically by introducing a slack adjustment factor. Theoretical analysis shows that it can adaptively optimize and adjust according to different positive and negative samples and significantly improve optimization efficiency. In addition, to better preserve local structural relationships during image translation, we constructed a regional differential structural consistency correction block using differential vectors. Comparative experiments were conducted using seven existing methods on five datasets. The results indicate that our method can maintain structural consistency between cross -domain images at a deeper level. Furthermore, it is more effective in establishing real image -domain mapping relations, resulting in higher -quality images being generated.

引用

页数：14

共 47 条

[1] HyperStyle: Sty1eGAN Inversion with HyperNetworks for Real Image Editing
Alaluf, Yuval
Tov, Omer
Mokady, Ron
Gal, Rinon
Bermano, Amit
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18490 - 18500
[2] DualAST: Dual Style-Learning Networks for Artistic Style Transfer
Chen, Haibo
Zhao, Lei
Wang, Zhizhong
Zhang, Huiming
Zuo, Zhiwen
Li, Ailin
Xing, Wei
Lu, Dongming
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 872 - 881
[3] Chen JY, 2021, Arxiv, DOI arXiv:2107.01152
[4] Chen T, 2020, PR MACH LEARN RES, V119
[5] Choi Y, 2020, PROC CVPR IEEE, P8185, DOI 10.1109/CVPR42600.2020.00821
[6] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8] Multi-feature contrastive learning for unpaired image-to-image translation
Gou, Yao
Li, Min
Song, Yu
He, Yujie
Wang, Litao
[J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4111 - 4122
[9] Gutmann Michael, 2010, P 13 INT C ART INT S, P297
[10] Dual Contrastive Learning for Unsupervised Image-to-Image Translation
Han, Junlin
Shoeiby, Mehrdad
Petersson, Lars
Armin, Mohammad Ali
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 746 - 755

← 1 2 3 4 5 →