Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引:15
|
作者
Dalva, Yusuf [1 ]
Pehlivan, Hamza [2 ]
Hatipoglu, Oyku Irmak [3 ]
Moran, Cansu [4 ]
Dundar, Aysegul [5 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Max Planck Inst, D-80539 Munich, Germany
[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[4] Tech Univ Munich, D-80333 Munich, Germany
[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye
关键词
Image translation; generative adversarial net works; latent space manipulation; face attribute editing;
D O I
10.1109/TPAMI.2023.3308102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.
引用
收藏
页码:14777 / 14788
页数:12
相关论文
共 50 条
  • [21] Robotic Instrument Segmentation With Image-to-Image Translation
    Colleoni, Emanuele
    Stoyanov, Danail
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 935 - 942
  • [22] Semantically Consistent Image-to-Image Translation for Unsupervised Domain Adaptation
    Brehm, Stephan
    Scherer, Sebastian
    Lienhart, Rainer
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 131 - 141
  • [23] Asymmetric GAN for Unpaired Image-to-Image Translation
    Li, Yu
    Tang, Sheng
    Zhang, Rui
    Zhang, Yongdong
    Li, Jintao
    Yan, Shuicheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (12) : 5881 - 5896
  • [24] Unpaired image-to-image translation of structural damage
    Varghese, Subin
    Hoskere, Vedhus
    ADVANCED ENGINEERING INFORMATICS, 2023, 56
  • [25] Image-to-image translation for wavefront and PSF estimation
    Smith, Jeffrey
    Cranney, Jesse
    Gretton, Charles
    Gratadour, Damien
    ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185
  • [26] CycleSAR: SAR Image Despeckling as Unpaired Image-to-Image Translation
    Lattari, Francesco
    Santomarco, Vincenzo
    Santambrogio, Riccardo
    Rucci, Alessio
    Matteucci, Matteo
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [27] OmniStyleGAN for Style-Guided Image-to-Image Translation
    Zhao, Qianyi
    Wang, Mengyin
    Zhang, Qing
    Wang, Fasheng
    Sun, Fuming
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 351 - 365
  • [28] Unpaired Image-to-Image Translation with Diffusion Adversarial Network
    Tu, Hangyao
    Wang, Zheng
    Zhao, Yanwei
    MATHEMATICS, 2024, 12 (20)
  • [29] GAIT: GRADIENT ADJUSTED UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION
    Akkaya, Ibrahim Batuhan
    Halici, Ugur
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1591 - 1595
  • [30] A Diffusion Model Translator for Efficient Image-to-Image Translation
    Xia, Mengfei
    Zhou, Yu
    Yi, Ran
    Liu, Yong-Jin
    Wang, Wenping
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10272 - 10283