Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引:15
|
作者
Dalva, Yusuf [1 ]
Pehlivan, Hamza [2 ]
Hatipoglu, Oyku Irmak [3 ]
Moran, Cansu [4 ]
Dundar, Aysegul [5 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Max Planck Inst, D-80539 Munich, Germany
[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[4] Tech Univ Munich, D-80333 Munich, Germany
[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye
关键词
Image translation; generative adversarial net works; latent space manipulation; face attribute editing;
D O I
10.1109/TPAMI.2023.3308102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.
引用
收藏
页码:14777 / 14788
页数:12
相关论文
共 50 条
  • [41] Literature Review of Generative models for Image-to-Image translation problems
    Kamil, Anwar
    Shaikh, Talal
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 341 - 346
  • [42] UNPAIRED IMAGE-TO-IMAGE TRANSLATION FROM SHARED DEEP SPACE
    Wu, Xuehui
    Shao, Jie
    Gao, Lianli
    Shen, Heng Tao
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2127 - 2131
  • [43] Image-to-image translation for wavefront and point spread function estimation
    Smith, Jeffrey
    Cranney, Jesse
    Gretton, Charles
    Gratadour, Damien
    JOURNAL OF ASTRONOMICAL TELESCOPES INSTRUMENTS AND SYSTEMS, 2023, 9 (01) : 19001
  • [44] Guided Image-to-Image Translation by Discriminator-Generator Communication
    Cao, Yuanjiang
    Yao, Lina
    Pan, Le
    Sheng, Quan Z.
    Chang, Xiaojun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1528 - 1538
  • [45] SUNIT: multimodal unsupervised image-to-image translation with shared encoder
    Lin, Liyuan
    Ji, Shulin
    Zhou, Yuan
    Zhang, Shun
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [46] GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation
    Shamsolmoali, Pourya
    Zareapoor, Masoumeh
    Das, Swagatam
    Garcia, Salvador
    Granger, Eric
    Yang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) : 874 - 886
  • [47] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
    Deng, Haipeng
    Wu, Qiuxia
    Huang, Han
    Yang, Xiaowei
    Wang, Zhiyong
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22) : 16593 - 16605
  • [48] Deep Generative Adversarial Networks for Image-to-Image Translation: A Review
    Alotaibi, Aziz
    SYMMETRY-BASEL, 2020, 12 (10): : 1 - 26
  • [49] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
    Haipeng Deng
    Qiuxia Wu
    Han Huang
    Xiaowei Yang
    Zhiyong Wang
    Neural Computing and Applications, 2023, 35 : 16593 - 16605
  • [50] Underwater dam crack image generation based on unsupervised image-to-image translation
    Huang, Ben
    Kang, Fei
    Li, Xinyu
    Zhu, Sisi
    AUTOMATION IN CONSTRUCTION, 2024, 163