Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引：15

作者：

Dalva, Yusuf ^{[1
]}

Pehlivan, Hamza ^{[2
]}

Hatipoglu, Oyku Irmak ^{[3
]}

Moran, Cansu ^{[4
]}

Dundar, Aysegul ^{[5
]}

机构：

[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA

[2] Max Planck Inst, D-80539 Munich, Germany

[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland

[4] Tech Univ Munich, D-80333 Munich, Germany

[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 12期

关键词：

Image translation; generative adversarial net works; latent space manipulation; face attribute editing;

D O I：

10.1109/TPAMI.2023.3308102

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.

引用

页码：14777 / 14788

页数：12

共 50 条

[21] Robotic Instrument Segmentation With Image-to-Image Translation
Colleoni, Emanuele
Stoyanov, Danail
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 935 - 942
[22] Semantically Consistent Image-to-Image Translation for Unsupervised Domain Adaptation
Brehm, Stephan
Scherer, Sebastian
Lienhart, Rainer
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 131 - 141
[23] Asymmetric GAN for Unpaired Image-to-Image Translation
Li, Yu
Tang, Sheng
Zhang, Rui
Zhang, Yongdong
Li, Jintao
Yan, Shuicheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (12) : 5881 - 5896
[24] Unpaired image-to-image translation of structural damage
Varghese, Subin
Hoskere, Vedhus
ADVANCED ENGINEERING INFORMATICS, 2023, 56
[25] Image-to-image translation for wavefront and PSF estimation
Smith, Jeffrey
Cranney, Jesse
Gretton, Charles
Gratadour, Damien
ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185
[26] CycleSAR: SAR Image Despeckling as Unpaired Image-to-Image Translation
Lattari, Francesco
Santomarco, Vincenzo
Santambrogio, Riccardo
Rucci, Alessio
Matteucci, Matteo
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[27] OmniStyleGAN for Style-Guided Image-to-Image Translation
Zhao, Qianyi
Wang, Mengyin
Zhang, Qing
Wang, Fasheng
Sun, Fuming
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 351 - 365
[28] Unpaired Image-to-Image Translation with Diffusion Adversarial Network
Tu, Hangyao
Wang, Zheng
Zhao, Yanwei
MATHEMATICS, 2024, 12 (20)
[29] GAIT: GRADIENT ADJUSTED UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION
Akkaya, Ibrahim Batuhan
Halici, Ugur
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1591 - 1595
[30] A Diffusion Model Translator for Efficient Image-to-Image Translation
Xia, Mengfei
Zhou, Yu
Yi, Ran
Liu, Yong-Jin
Wang, Wenping
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10272 - 10283

← 1 2 3 4 5 →