Multilevel contrast strategy for unpaired image-to-image translation

被引:1
作者
Han, Minggui [1 ]
Shao, Mingwen [1 ]
Meng, Lingzhuang [1 ]
Liu, Yuexian [1 ]
Qiao, Yuanjian [1 ]
机构
[1] China Univ Petr, Qingdao Inst Software, Coll Comp Sci & Technol, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
image-to-image translation; contrastive learning; multilevel contrast strategy; GENERATIVE ADVERSARIAL NETWORK; REPRESENTATION;
D O I
10.1117/1.JEI.32.6.063030
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Contrastive learning for unpaired image-to-image translation utilizes adversarial loss to ensure the realism of generated images in the target domain and incorporates pixel-wise contrastive loss to maximize the correlation between them. However, existing methods only contrast pixel-wise features, ignoring higher-level features, and the pixel-wise contrast is imperfect, which leads to poorer perceptual and visual results. In order to alleviate these problems, we propose an effective multilevel contrast strategy for unpaired image-to-image translation (MLCUT), which contrasts features at three levels to generate more harmonious and realistic images. Specifically, we strengthen the pixel-wise level contrast and introduce the contrasts of plane and voxel-wise levels. On the one hand, MLCUT enhances training effectiveness by picking over hard negative keys for each query at the pixel-wise level. On the other hand, we strengthen the learning preferences of generators on features of objects rather than backgrounds by contrasting the plane-wise discriminative matrices in adversarial loss. Furthermore, by contrasting voxel-wise global semantic vectors, MLCUT effectively improves the realism of generated images and avoids mode collapse. Qualitative and quantitative experiments demonstrate that our method effectively improves performance in perception and vision.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Image-to-Image Translation: Methods and Applications
    Pang, Yingxue
    Lin, Jianxin
    Qin, Tao
    Chen, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3859 - 3881
  • [32] Multimodal Unsupervised Image-to-Image Translation
    Huang, Xun
    Liu, Ming-Yu
    Belongie, Serge
    Kautz, Jan
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196
  • [33] Hierarchical image-to-image translation with nested distributions modeling
    Qiao, Shishi
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    PATTERN RECOGNITION, 2024, 146
  • [34] A novel framework for image-to-image translation and image compression
    Yang, Fei
    Wang, Yaxing
    Herranz, Luis
    Cheng, Yongmei
    Mozerov, Mikhail G.
    NEUROCOMPUTING, 2022, 508 : 58 - 70
  • [35] Guided Image Weathering using Image-to-Image Translation
    Chen, Yu
    Shen, I-Chao
    Chen, Bing-Yu
    PROCEEDINGS OF SIGGRAPH ASIA 2021 TECHNICAL COMMUNICATIONS, 2021,
  • [36] Enhancing Image-to-Image Translation with Contrast Loss Constrained Generators and Selective Neighborhood Sampling
    Li, Meifang
    Wang, Xiaoru
    Bian, Dexin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 415 - 426
  • [37] Towards semantically continuous unpaired image-to-image translation via margin adaptive contrastive learning and wavelet transform
    Zhang, Heng
    Yang, Yi-Jun
    Zeng, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [38] Spatial-Intensity Transforms for Medical Image-to-Image Translation
    Wang, Clinton J.
    Rost, Natalia S.
    Golland, Polina
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (11) : 3362 - 3373
  • [39] Unsupervised Exemplar-Domain Aware Image-to-Image Translation
    Fu, Yuanbin
    Ma, Jiayi
    Guo, Xiaojie
    ENTROPY, 2021, 23 (05)
  • [40] Image-to-Image Translation Based Face De-Occlusion
    Maharjan, Rahul S.
    Din, Nizam Ud
    Yi, Juneho
    TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2020), 2020, 11519