HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引：9

作者：

Zhou, Peng ^{[1
]}

Xie, Lingxi ^{[2
]}

Ni, Bingbing ^{[1
]}

Liu, Lin ^{[3
]}

Tian, Qi ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China

[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 05期

基金：

美国国家科学基金会;

关键词：

Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;

D O I：

10.1109/TCSVT.2022.3222456

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.

引用

页码：2147 / 2161

页数：15

共 50 条

[21] SDIT: Scalable and Diverse Cross-domain Image Translation
Wang, Yaxing
Gonzalez-Garcia, Abel
van de Weijer, Joost
Herranz, Luis
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1267 - 1276
[22] Discriminative Style Learning for Cross-Domain Image Captioning
Yuan, Jin
Zhu, Shuai
Huang, Shuyin
Zhang, Hanwang
Xiao, Yaoqiang
Li, Zhiyong
Wang, Meng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1723 - 1736
[23] Active Discriminative Cross-Domain Alignment for Low-Resolution Face Recognition
Zheng, Dongdong
Zhang, Kaibing
Lu, Jian
Jing, Junfeng
Xiong, Zenggang
IEEE ACCESS, 2020, 8 : 97503 - 97515
[24] Latent space manipulation for high-resolution medical image synthesis via the StyleGAN
Fetty, Lukas
Bylund, Mikael
Kuess, Peter
Heilemann, Gerd
Nyholm, Tufve
Georg, Dietmar
Lofstedt, Tommy
ZEITSCHRIFT FUR MEDIZINISCHE PHYSIK, 2020, 30 (04): : 305 - 314
[25] ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation
Liu, Yahui
Chen, Yajing
Bao, Linchao
Sebe, Nicu
Lepri, Bruno
De Nadai, Marco
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3343 - 3353
[26] A Two-Stage GAN for High-Resolution Retinal Image Generation and Segmentation
Andreini, Paolo
Ciano, Giorgio
Bonechi, Simone
Graziani, Caterina
Lachi, Veronica
Mecocci, Alessandro
Sodi, Andrea
Scarselli, Franco
Bianchini, Monica
ELECTRONICS, 2022, 11 (01)
[27] Integrating Cross-Domain Feature Representation and Semantic Guidance for Underwater Image Enhancement
Li, Fei
Zheng, Jiangbin
Wang, Lu
Wang, Shengkang
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1511 - 1515
[28] Exploring Cross-Domain Pretrained Model for Hyperspectral Image Classification
Lee, Hyungtae
Eum, Sungmin
Kwon, Heesung
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[29] ProfileSR-GAN: A GAN Based Super-Resolution Method for Generating High-Resolution Load Profiles
Song, Lidong
Li, Yiyan
Lu, Ning
IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (04) : 3278 - 3289
[30] Domain Adversarial Disentanglement Network With Cross-Domain Synthesis for Generalized Face Anti-Spoofing
Yan, Wenjun
Zeng, Ying
Hu, Haifeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7033 - 7046

← 1 2 3 4 5 →