Spatial-Frequency Mutual Learning for Face Super-Resolution

被引:65
作者
Wang, Chenyang [1 ]
Jiang, Junjun [1 ]
Zhong, Zhiwei [1 ]
Liu, Xianming [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.02141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face super-resolution (FSR) aims to reconstruct high-resolution (HR) face images from the low-resolution (LR) ones. With the advent of deep learning, the FSR technique has achieved significant breakthroughs. However, existing FSR methods either have a fixed receptive field or fail to maintain facial structure, limiting the FSR performance. To circumvent this problem, Fourier transform is introduced, which can capture global facial structure information and achieve image-size receptive field. Relying on the Fourier transform, we devise a spatial-frequency mutual network (SFMNet) for FSR, which is the first FSR method to explore the correlations between spatial and frequency domains as far as we know. To be specific, our SFMNet is a two-branch network equipped with a spatial branch and a frequency branch. Benefiting from the property of Fourier transform, the frequency branch can achieve image-size receptive field and capture global dependency while the spatial branch can extract local dependency. Considering that these dependencies are complementary and both favorable for FSR, we further develop a frequency-spatial interaction block (FSIB) which mutually amalgamates the complementary spatial and frequency information to enhance the capability of the model. Quantitative and qualitative experimental results show that the proposed method outperforms state-of-the-art FSR methods in recovering face images. The implementation and model will be released at https://github.com/wcy-cs/SFMNet.
引用
收藏
页码:22356 / 22366
页数:11
相关论文
共 59 条
[1]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.01415
[2]  
Baker S., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P83, DOI 10.1109/AFGR.2000.840616
[3]   OpenFace 2.0: Facial Behavior Analysis Toolkit [J].
Baltrusaitis, Tadas ;
Zadeh, Amir ;
Lim, Yao Chong ;
Morency, Louis-Philippe .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :59-66
[4]   Constrained Local Neural Fields for robust facial landmark detection in the wild [J].
Baltrusaitis, Tadas ;
Robinson, Peter ;
Morency, Louis-Philippe .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :354-361
[5]   Super-FAN: Integrated facial landmark localization and super-resolution of real-world low resolution faces in arbitrary poses with GANs [J].
Bulat, Adrian ;
Tzimiropoulos, Georgios .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :109-117
[6]   Attention-Aware Face Hallucination via Deep Reinforcement Learning [J].
Cao, Qingxing ;
Lin, Liang ;
Shi, Yukai ;
Liang, Xiaodan ;
Li, Guanbin .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1656-1664
[7]   Super-resolution of face images using kernel PCA-based prior [J].
Chakrabarti, Ayan ;
Rajagopalan, A. N. ;
Chellappa, Rama .
IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (04) :888-892
[8]   GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution [J].
Chan, Kelvin C. K. ;
Wang, Xintao ;
Xu, Xiangyu ;
Gu, Jinwei ;
Loy, Chen Change .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14240-14249
[9]  
Chen Y. J., 2018, TOURISM EC, V6, P1
[10]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307