Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引：15

作者：

Dalva, Yusuf ^{[1
]}

Pehlivan, Hamza ^{[2
]}

Hatipoglu, Oyku Irmak ^{[3
]}

Moran, Cansu ^{[4
]}

Dundar, Aysegul ^{[5
]}

机构：

[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA

[2] Max Planck Inst, D-80539 Munich, Germany

[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland

[4] Tech Univ Munich, D-80333 Munich, Germany

[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 12期

关键词：

Image translation; generative adversarial net works; latent space manipulation; face attribute editing;

D O I：

10.1109/TPAMI.2023.3308102

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.

引用

页码：14777 / 14788

页数：12

共 50 条

[1] VecGAN: Image-to-Image Translation with Interpretable Latent Directions
Dalva, Yusuf
Altindis, Said Fahri
Dundar, Aysegul
COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 153 - 169
[2] Style-Guided and Disentangled Representation for Robust Image-to-Image Translation
Choi, Jaewoong
Kim, Daeha
Song, Byung Cheol
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 463 - 471
[3] DRIT plus plus : Diverse Image-to-Image Translation via Disentangled Representations
Lee, Hsin-Ying
Tseng, Hung-Yu
Mao, Qi
Huang, Jia-Bin
Lu, Yu-Ding
Singh, Maneesh
Yang, Ming-Hsuan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (10-11) : 2402 - 2417
[4] Unpaired Image-to-Image Translation via Latent Energy Transport
Zhao, Yang
Chen, Changyou
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16413 - 16422
[5] Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors
Mao, Qi
Tseng, Hung-Yu
Lee, Hsin-Ying
Huang, Jia-Bin
Ma, Siwei
Yang, Ming-Hsuan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (02) : 517 - 549
[6] Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors
Qi Mao
Hung-Yu Tseng
Hsin-Ying Lee
Jia-Bin Huang
Siwei Ma
Ming-Hsuan Yang
International Journal of Computer Vision, 2022, 130 : 517 - 549
[7] Hypercomplex Image-to-Image Translation
Grassucci, Eleonora
Sigillo, Luigi
Uncini, Aurelio
Comminiello, Danilo
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[8] Generative image completion with image-to-image translation
Xu, Shuzhen
Zhu, Qing
Wang, Jin
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11) : 7333 - 7345
[9] Generative image completion with image-to-image translation
Shuzhen Xu
Qing Zhu
Jin Wang
Neural Computing and Applications, 2020, 32 : 7333 - 7345
[10] One-way multimodal image-to-image translation for heterogeneous face recognition
Ji, Shulin
Zhai, Xingang
Liu, Jie
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)

← 1 2 3 4 5 →