HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引:9
|
作者
Zhou, Peng [1 ]
Xie, Lingxi [2 ]
Ni, Bingbing [1 ]
Liu, Lin [3 ]
Tian, Qi [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China
[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;
D O I
10.1109/TCSVT.2022.3222456
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.
引用
收藏
页码:2147 / 2161
页数:15
相关论文
共 50 条
  • [21] SDIT: Scalable and Diverse Cross-domain Image Translation
    Wang, Yaxing
    Gonzalez-Garcia, Abel
    van de Weijer, Joost
    Herranz, Luis
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1267 - 1276
  • [22] Discriminative Style Learning for Cross-Domain Image Captioning
    Yuan, Jin
    Zhu, Shuai
    Huang, Shuyin
    Zhang, Hanwang
    Xiao, Yaoqiang
    Li, Zhiyong
    Wang, Meng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1723 - 1736
  • [23] Active Discriminative Cross-Domain Alignment for Low-Resolution Face Recognition
    Zheng, Dongdong
    Zhang, Kaibing
    Lu, Jian
    Jing, Junfeng
    Xiong, Zenggang
    IEEE ACCESS, 2020, 8 : 97503 - 97515
  • [24] Latent space manipulation for high-resolution medical image synthesis via the StyleGAN
    Fetty, Lukas
    Bylund, Mikael
    Kuess, Peter
    Heilemann, Gerd
    Nyholm, Tufve
    Georg, Dietmar
    Lofstedt, Tommy
    ZEITSCHRIFT FUR MEDIZINISCHE PHYSIK, 2020, 30 (04): : 305 - 314
  • [25] ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation
    Liu, Yahui
    Chen, Yajing
    Bao, Linchao
    Sebe, Nicu
    Lepri, Bruno
    De Nadai, Marco
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3343 - 3353
  • [26] A Two-Stage GAN for High-Resolution Retinal Image Generation and Segmentation
    Andreini, Paolo
    Ciano, Giorgio
    Bonechi, Simone
    Graziani, Caterina
    Lachi, Veronica
    Mecocci, Alessandro
    Sodi, Andrea
    Scarselli, Franco
    Bianchini, Monica
    ELECTRONICS, 2022, 11 (01)
  • [27] Integrating Cross-Domain Feature Representation and Semantic Guidance for Underwater Image Enhancement
    Li, Fei
    Zheng, Jiangbin
    Wang, Lu
    Wang, Shengkang
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1511 - 1515
  • [28] Exploring Cross-Domain Pretrained Model for Hyperspectral Image Classification
    Lee, Hyungtae
    Eum, Sungmin
    Kwon, Heesung
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [29] ProfileSR-GAN: A GAN Based Super-Resolution Method for Generating High-Resolution Load Profiles
    Song, Lidong
    Li, Yiyan
    Lu, Ning
    IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (04) : 3278 - 3289
  • [30] Domain Adversarial Disentanglement Network With Cross-Domain Synthesis for Generalized Face Anti-Spoofing
    Yan, Wenjun
    Zeng, Ying
    Hu, Haifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7033 - 7046