Wavelet Domain Generative Adversarial Network for Multi-scale Face Hallucination

被引:58
|
作者
Huang, Huaibo [1 ,2 ,3 ,4 ]
He, Ran [1 ,2 ,3 ,4 ]
Sun, Zhenan [1 ,2 ,3 ,4 ]
Tan, Tieniu [1 ,2 ,3 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[2] CASIA, Ctr Res Intelligent Percept & Comp, Beijing, Peoples R China
[3] CASIA, Natl Lab Pattern Recognit, Beijing, Peoples R China
[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Face hallucination; Super-resolution; Wavelet transform; Generative adversarial network; Face recognition; SUPERRESOLUTION; IMAGE;
D O I
10.1007/s11263-019-01154-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most modern face hallucination methods resort to convolutional neural networks (CNN) to infer high-resolution (HR) face images. However, when dealing with very low-resolution (LR) images, these CNN based methods tend to produce over-smoothed outputs. To address this challenge, this paper proposes a wavelet-domain generative adversarial method that can ultra-resolve a very low-resolution (like 16x16 or even 8x8) face image to its larger version of multiple upscaling factors (2x to 16x) in a unified framework. Different from the most existing studies that hallucinate faces in image pixel domain, our method firstly learns to predict the wavelet information of HR face images from its corresponding LR inputs before image-level super-resolution. To capture both global topology information and local texture details of human faces, a flexible and extensible generative adversarial network is designed with three types of losses: (1) wavelet reconstruction loss aims to push wavelets closer with the ground-truth; (2) wavelet adversarial loss aims to generate realistic wavelets; (3) identity preserving loss aims to help identity information recovery. Extensive experiments demonstrate that the presented approach not only achieves more appealing results both quantitatively and qualitatively than state-of-the-art face hallucination methods, but also can significantly improve identification accuracy for low-resolution face images captured in the wild.
引用
收藏
页码:763 / 784
页数:22
相关论文
共 50 条
  • [31] Multi-scale self-attention generative adversarial network for pathology image restoration
    Liang, Meiyan
    Zhang, Qiannan
    Wang, Guogang
    Xu, Na
    Wang, Lin
    Liu, Haishun
    Zhang, Cunlin
    VISUAL COMPUTER, 2023, 39 (09) : 4305 - 4321
  • [32] Multi-scale Generative Adversarial Network for Person Re-identification under Occlusion
    Yang W.-X.
    Yan Y.
    Chen S.
    Zhang X.-K.
    Wang H.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (07): : 1943 - 1958
  • [33] Generative Adversarial Network Based on Multi-scale Dense Feature Fusion for Image Dehazing
    Lian J.
    Chen S.
    Ding K.
    Li L.-H.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2022, 43 (11): : 1591 - 1598
  • [34] Multi-scale self-attention generative adversarial network for pathology image restoration
    Meiyan Liang
    Qiannan Zhang
    Guogang Wang
    Na Xu
    Lin Wang
    Haishun Liu
    Cunlin Zhang
    The Visual Computer, 2023, 39 : 4305 - 4321
  • [35] Face hallucination using PCA in wavelet domain
    Abdu, Rahiman, V
    Jiji, C., V
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 180 - 187
  • [36] Recurrent Generative Adversarial Network for Face Completion
    Wang, Qiang
    Fan, Huijie
    Sun, Gan
    Ren, Weihong
    Tang, Yandong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 429 - 442
  • [37] End-to-end latent fingerprint enhancement using multi-scale Generative Adversarial Network
    Pramukha, R. N.
    Akhila, P.
    Koolagudi, Shashidhar G.
    PATTERN RECOGNITION LETTERS, 2024, 184 : 169 - 175
  • [38] Multi-scale RGB and NIR image Cross-fusion based on Generative Adversarial Network
    Xiang, Sen
    Hu, Zishan
    Deng, Huiping
    Wu, Jin
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4172 - 4177
  • [39] An Unsupervised Multi-scale Generative Adversarial Network for Remote Sensing Image Pan-Sharpening
    Wang, Yajie
    Xie, Yanyan
    Wu, Yanyan
    Liang, Kai
    Qiao, Jilin
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 356 - 368
  • [40] SIGAN: A Multi-Scale Generative Adversarial Network for Underwater Sonar Image Super-Resolution
    Peng, Chengyang
    Jin, Shaohua
    Bian, Gang
    Cui, Yang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (07)