Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引：15

作者：

Dalva, Yusuf ^{[1
]}

Pehlivan, Hamza ^{[2
]}

Hatipoglu, Oyku Irmak ^{[3
]}

Moran, Cansu ^{[4
]}

Dundar, Aysegul ^{[5
]}

机构：

[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA

[2] Max Planck Inst, D-80539 Munich, Germany

[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland

[4] Tech Univ Munich, D-80333 Munich, Germany

[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 12期

关键词：

Image translation; generative adversarial net works; latent space manipulation; face attribute editing;

D O I：

10.1109/TPAMI.2023.3308102

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.

引用

页码：14777 / 14788

页数：12

共 50 条

[41] Literature Review of Generative models for Image-to-Image translation problems
Kamil, Anwar
Shaikh, Talal
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 341 - 346
[42] UNPAIRED IMAGE-TO-IMAGE TRANSLATION FROM SHARED DEEP SPACE
Wu, Xuehui
Shao, Jie
Gao, Lianli
Shen, Heng Tao
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2127 - 2131
[43] Image-to-image translation for wavefront and point spread function estimation
Smith, Jeffrey
Cranney, Jesse
Gretton, Charles
Gratadour, Damien
JOURNAL OF ASTRONOMICAL TELESCOPES INSTRUMENTS AND SYSTEMS, 2023, 9 (01) : 19001
[44] Guided Image-to-Image Translation by Discriminator-Generator Communication
Cao, Yuanjiang
Yao, Lina
Pan, Le
Sheng, Quan Z.
Chang, Xiaojun
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1528 - 1538
[45] SUNIT: multimodal unsupervised image-to-image translation with shared encoder
Lin, Liyuan
Ji, Shulin
Zhou, Yuan
Zhang, Shun
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
[46] GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation
Shamsolmoali, Pourya
Zareapoor, Masoumeh
Das, Swagatam
Garcia, Salvador
Granger, Eric
Yang, Jie
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) : 874 - 886
[47] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
Deng, Haipeng
Wu, Qiuxia
Huang, Han
Yang, Xiaowei
Wang, Zhiyong
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22) : 16593 - 16605
[48] Deep Generative Adversarial Networks for Image-to-Image Translation: A Review
Alotaibi, Aziz
SYMMETRY-BASEL, 2020, 12 (10): : 1 - 26
[49] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
Haipeng Deng
Qiuxia Wu
Han Huang
Xiaowei Yang
Zhiyong Wang
Neural Computing and Applications, 2023, 35 : 16593 - 16605
[50] Underwater dam crack image generation based on unsupervised image-to-image translation
Huang, Ben
Kang, Fei
Li, Xinyu
Zhu, Sisi
AUTOMATION IN CONSTRUCTION, 2024, 163

← 1 2 3 4 5 →