Enhanced Unpaired Image-to-Image Translation via Transformation in Saliency Domain

被引:0
|
作者
Shibasaki, Kei [1 ]
Ikehara, Masaaki [1 ]
机构
[1] Keio Univ, Fac Sci & Technol, Dept Elect & Informat Engn, Yokohama, Kanagawa 2238522, Japan
来源
IEEE ACCESS | 2023年 / 11卷
关键词
Computer vision; Generative adversarial networks; Saliency detection; deep learning; unpaired image to image translation; generative adversarial networks; saliency domain;
D O I
10.1109/ACCESS.2023.3338629
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unpaired image to image translation is the task of converting images in unpaired datasets. The primary goal of the task is to translate a source image into the image aligned with the target domain while keeping the fundamental content. Existing researches have introduced effective techniques to translate images with unpaired datasets, focusing on preserving the fundamental content. However, these techniques have limitations in dealing with significant shape changes and preserving backgrounds that should not be transformed. The proposed method attempts to address these problems by utilizing the saliency domain for translation and simultaneously learning the translation in the saliency domain as well as in the image domain. The saliency domain represents the shape and position of the main object. The explicit learning of transformations within the saliency domain improves network's ability to transform shapes while maintaining the background. Experimental results show that the proposed method successfully addresses the problems of unpaired image to image translation and achieves competitive metrics with existing methods.
引用
收藏
页码:137495 / 137505
页数:11
相关论文
共 50 条
  • [21] VAE-CoGAN: Unpaired image-to-image translation for low-level vision
    Juan Zhang
    Xiaoqi Lang
    Bo Huang
    Xiaoyan Jiang
    Signal, Image and Video Processing, 2023, 17 : 1019 - 1026
  • [22] CoPrGAN: Image-to-Image Translation via Content Preservation
    Yu, Xiaoming
    Zhou, Gan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 37 - 49
  • [23] Unsupervised image-to-image translation using intra-domain reconstruction loss
    Fan, Yuan
    Shao, Mingwen
    Zuo, Wangmeng
    Li, Qingyun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2077 - 2088
  • [24] Unsupervised image-to-image translation using intra-domain reconstruction loss
    Yuan Fan
    Mingwen Shao
    Wangmeng Zuo
    Qingyun Li
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 2077 - 2088
  • [25] DMDIT: Diverse multi-domain image-to-image translation
    Shao, Mingwen
    Zhang, Youcai
    Liu, Huan
    Wang, Chao
    Li, Le
    Shao, Xun
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [26] Semantically Consistent Image-to-Image Translation for Unsupervised Domain Adaptation
    Brehm, Stephan
    Scherer, Sebastian
    Lienhart, Rainer
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 131 - 141
  • [27] SEMANTIC-AWARE UNPAIRED IMAGE-TO-IMAGE TRANSLATION FOR URBAN SCENE IMAGES
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2150 - 2154
  • [28] Hypercomplex Image-to-Image Translation
    Grassucci, Eleonora
    Sigillo, Luigi
    Uncini, Aurelio
    Comminiello, Danilo
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [29] Unsupervised natural scene image-to-image translation via independence-enhanced latent space
    Zhigang Yang
    Lin Hou
    Ye Bai
    Siyuan Cang
    Yuanlan Yang
    Tao Chen
    Signal, Image and Video Processing, 2025, 19 (8)
  • [30] Co-DGAN: cooperating discriminator generative adversarial networks for unpaired image-to-image translation
    Liu, Huajun
    Wang, Ziyan
    Sui, Haigang
    Zhu, Qing
    Liu, Shubo
    Chen, Lei
    Lei, Dian
    Li, Chao
    SOFT COMPUTING, 2021, 25 (06) : 4653 - 4662