Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引:15
|
作者
Dalva, Yusuf [1 ]
Pehlivan, Hamza [2 ]
Hatipoglu, Oyku Irmak [3 ]
Moran, Cansu [4 ]
Dundar, Aysegul [5 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Max Planck Inst, D-80539 Munich, Germany
[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[4] Tech Univ Munich, D-80333 Munich, Germany
[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye
关键词
Image translation; generative adversarial net works; latent space manipulation; face attribute editing;
D O I
10.1109/TPAMI.2023.3308102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.
引用
收藏
页码:14777 / 14788
页数:12
相关论文
共 50 条
  • [1] VecGAN: Image-to-Image Translation with Interpretable Latent Directions
    Dalva, Yusuf
    Altindis, Said Fahri
    Dundar, Aysegul
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 153 - 169
  • [2] Style-Guided and Disentangled Representation for Robust Image-to-Image Translation
    Choi, Jaewoong
    Kim, Daeha
    Song, Byung Cheol
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 463 - 471
  • [3] DRIT plus plus : Diverse Image-to-Image Translation via Disentangled Representations
    Lee, Hsin-Ying
    Tseng, Hung-Yu
    Mao, Qi
    Huang, Jia-Bin
    Lu, Yu-Ding
    Singh, Maneesh
    Yang, Ming-Hsuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (10-11) : 2402 - 2417
  • [4] Unpaired Image-to-Image Translation via Latent Energy Transport
    Zhao, Yang
    Chen, Changyou
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16413 - 16422
  • [5] Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors
    Mao, Qi
    Tseng, Hung-Yu
    Lee, Hsin-Ying
    Huang, Jia-Bin
    Ma, Siwei
    Yang, Ming-Hsuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (02) : 517 - 549
  • [6] Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors
    Qi Mao
    Hung-Yu Tseng
    Hsin-Ying Lee
    Jia-Bin Huang
    Siwei Ma
    Ming-Hsuan Yang
    International Journal of Computer Vision, 2022, 130 : 517 - 549
  • [7] Hypercomplex Image-to-Image Translation
    Grassucci, Eleonora
    Sigillo, Luigi
    Uncini, Aurelio
    Comminiello, Danilo
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [8] Generative image completion with image-to-image translation
    Xu, Shuzhen
    Zhu, Qing
    Wang, Jin
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11) : 7333 - 7345
  • [9] Generative image completion with image-to-image translation
    Shuzhen Xu
    Qing Zhu
    Jin Wang
    Neural Computing and Applications, 2020, 32 : 7333 - 7345
  • [10] One-way multimodal image-to-image translation for heterogeneous face recognition
    Ji, Shulin
    Zhai, Xingang
    Liu, Jie
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)