HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引：9

作者：

Zhou, Peng ^{[1
]}

Xie, Lingxi ^{[2
]}

Ni, Bingbing ^{[1
]}

Liu, Lin ^{[3
]}

Tian, Qi ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China

[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 05期

基金：

美国国家科学基金会;

关键词：

Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;

D O I：

10.1109/TCSVT.2022.3222456

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.

引用

页码：2147 / 2161

页数：15

共 50 条

[1] Cross-View Image Synthesis From a Single Image With Progressive Parallel GAN
Zhu, Yingying
Chen, Shihai
Lu, Xiufan
Chen, Jianyong
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[2] DiCyc: GAN-based deformation invariant cross-domain information fusion for medical image synthesis
Wang, Chengjia
Yang, Guang
Papanastasiou, Giorgos
Tsaftaris, Sotirios A.
Newby, David E.
Gray, Calum
Macnaught, Gillian
MacGillivray, Tom J.
INFORMATION FUSION, 2021, 67 : 147 - 160
[3] Aggregated Contextual Transformations for High-Resolution Image Inpainting
Zeng, Yanhong
Fu, Jianlong
Chao, Hongyang
Guo, Baining
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (07) : 3266 - 3280
[4] DHNet: High-resolution and hierarchical network for cross-domain OCT speckle noise reduction
Zhou, Yi
Li, Jiang
Wang, Meng
Peng, Yuanyuan
Chen, Zhongyue
Zhu, Weifang
Shi, Fei
Wang, Lianyu
Wang, Tingting
Yao, Chenpu
Chen, Xinjian
MEDICAL PHYSICS, 2022, 49 (09) : 5914 - 5928
[5] Deep Adversarial Domain Adaptation Method for Cross-Domain Classification in High-Resolution Remote Sensing Images
Teng Wenxiu
Wang Ni
Chen Taisheng
Wang Benlin
Chen Menglin
Shi Huihui
LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (11)
[6] Mask Embedding for Realistic High-Resolution Medical Image Synthesis
Ren, Yinhao
Zhu, Zhe
Li, Yingzhou
Kong, Dehan
Hou, Rui
Grimm, Lars J.
Marks, Jeffery R.
Lo, Joseph Y.
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 : 422 - 430
[7] Multitask Learning for Cross-Domain Image Captioning
Yang, Min
Zhao, Wei
Xu, Wei
Feng, Yabing
Zhao, Zhou
Chen, Xiaojun
Lei, Kai
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) : 1047 - 1061
[8] Dual Learning for Cross-domain Image Captioning
Zhao, Wei
Xu, Wei
Yang, Min
Ye, Jianbo
Zhao, Zhou
Feng, Yabing
Qiao, Yu
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 29 - 38
[9] CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis
Yubo Zhang
Shuang Han
Zhongxin Zhang
Jianyang Wang
Hongbo Bi
The Visual Computer, 2023, 39 : 1283 - 1293
[10] CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis
Zhang, Yubo
Han, Shuang
Zhang, Zhongxin
Wang, Jianyang
Bi, Hongbo
VISUAL COMPUTER, 2023, 39 (04) : 1283 - 1293

← 1 2 3 4 5 →