HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引:9
|
作者
Zhou, Peng [1 ]
Xie, Lingxi [2 ]
Ni, Bingbing [1 ]
Liu, Lin [3 ]
Tian, Qi [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China
[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;
D O I
10.1109/TCSVT.2022.3222456
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.
引用
收藏
页码:2147 / 2161
页数:15
相关论文
共 50 条
  • [31] Cross-domain object detection using unsupervised image translation
    Arruda, Vinicius F.
    Berriel, Rodrigo F.
    Paixao, Thiago M.
    Badue, Claudine
    De Souza, Alberto F.
    Sebe, Nicu
    Oliveira-Santos, Thiago
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
  • [32] Deep High-Resolution Representation Learning for Cross-Resolution Person Re-Identification
    Zhang, Guoqing
    Ge, Yu
    Dong, Zhicheng
    Wang, Hao
    Zheng, Yuhui
    Chen, Shengyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8913 - 8925
  • [33] Coarse-to-Fine Joint Distribution Alignment for Cross-Domain Hyperspectral Image Classification
    Miao, Jiajia
    Zhang, Bo
    Wang, Bin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 12415 - 12428
  • [34] High-Fidelity Image Inpainting with GAN Inversion
    Yu, Yongsheng
    Zhang, Libo
    Fan, Heng
    Luo, Tiejian
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 242 - 258
  • [35] Cross-domain point cloud completion for multi-class indoor incomplete objects via class-conditional GAN inversion
    Zhang, Zhenxin
    Leng, Siyi
    Zhang, Liqiang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 206 : 118 - 131
  • [36] SCB-GAN: A High-Quality Small Celestial Body Surface Image Synthesis Method
    Lu, Wenlong
    Fan, Mingrui
    Niu, Wenlong
    Peng, Xiaodong
    Yang, Zhen
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (06) : 8131 - 8144
  • [37] High-resolution dermoscopy image synthesis with conditional generative adversarial networks
    Ding, Saisai
    Zheng, Jian
    Liu, Zhaobang
    Zheng, Yanyan
    Chen, Yanmei
    Xu, Xiaomin
    Lu, Jia
    Xie, Jing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 64
  • [38] Learning Unsupervised Cross-domain Image-to-Image Translation using a Shared Discriminator
    Kumar, Rajiv
    Dabral, Rishabh
    Sivakumar, G.
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 256 - 264
  • [39] Unsupervised content and style learning for multimodal cross-domain image translation
    Lin, Zhijie
    Chen, Jingjing
    Ma, Xiaolong
    Li, Chao
    Zhang, Huiming
    Zhao, Lei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [40] Discriminative Vision Transformer for Heterogeneous Cross-Domain Hyperspectral Image Classification
    Ye, Minchao
    Ling, Jiawei
    Huo, Wanli
    Zhang, Zhaojuan
    Xiong, Fengchao
    Qian, Yuntao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62