Pyramid-VAE-GAN: Transferring hierarchical latent variables for image inpainting

被引:4
|
作者
Tian, Huiyuan [1 ]
Zhang, Li [1 ,2 ]
Li, Shijian [1 ]
Yao, Min [1 ]
Pan, Gang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Adv Technol Res Inst, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
image inpainting; variational autoencoder (VAE); latent variable transfer (LTN); pyramid structure; generative model;
D O I
10.1007/s41095-022-0331-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Significant progress has been made in image inpainting methods in recent years. However, they are incapable of producing inpainting results with reasonable structures, rich detail, and sharpness at the same time. In this paper, we propose the Pyramid-VAE-GAN network for image inpainting to address this limitation. Our network is built on a variational autoencoder (VAE) backbone that encodes high-level latent variables to represent complicated high-dimensional prior distributions of images. The prior assists in reconstructing reasonable structures when inpainting. We also adopt a pyramid structure in our model to maintain rich detail in low-level latent variables. To avoid the usual incompatibility of requiring both reasonable structures and rich detail, we propose a novel cross-layer latent variable transfer module. This transfers information about long-range structures contained in high-level latent variables to low-level latent variables representing more detailed information. We further use adversarial training to select the most reasonable results and to improve the sharpness of the images. Extensive experimental results on multiple datasets demonstrate the superiority of our method. Our code is available at https://github.com/ thy960112/Pyramid-VAE-GAN.
引用
收藏
页码:827 / 841
页数:15
相关论文
共 2 条
  • [1] Pyramid-VAE-GAN: Transferring hierarchical latent variables for image inpainting
    Huiyuan Tian
    Li Zhang
    Shijian Li
    Min Yao
    Gang Pan
    Computational Visual Media, 2023, 9 : 827 - 841
  • [2] Blind Image Inpainting Using Pyramid GAN on Thyroid Ultrasound Images
    Li, Xuewei
    Shen, Hongqian
    Yu, Mei
    Wei, Xi
    Han, Jiang
    Zhu, Jialin
    Gao, Jie
    Liu, Zhiqiang
    Zhang, Yulin
    Yu, Ruiguo
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 678 - 683