Learning Face Image Super-Resolution Through Facial Semantic Attribute Transformation and Self-Attentive Structure Enhancement

被引:27
作者
Li, Mengyan [1 ]
Zhang, Zhaoyu [1 ]
Yu, Jun [1 ]
Chen, Chang Wen [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Peoples R China
[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
基金
中国国家自然科学基金;
关键词
Face; Semantics; Feature extraction; Task analysis; Facial features; Face super-resolution; face hallucination; facial attribute transformation; facial structure enhancement; GENERATIVE ADVERSARIAL NETWORKS; HALLUCINATION; REGRESSION; RETRIEVAL;
D O I
10.1109/TMM.2020.2984092
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face super-resolution is a domain-specific super-resolution (SR) problem of generating high-resolution (HR) face images from low-resolution (LR) inputs. Even though existing face SR methods have achieved great performance on the global region evaluation, most of them cannot restore local attributes and structure reasonably, especially to ultra-resolve tiny LR face images (16 x 16 pixels) to its larger version (8 x upscaling factor). In this paper, we propose an open source face SR framework based on facial semantic attribute transformation and self-attentive structure enhancement. Specifically, the proposed framework introduces face semantic information (i.e., face attributes) and face structure information (i.e., face boundaries) in a successive two-stage fashion. In the first stage, an Attribute Transformation Network (AT-Net) is established. It upsamples LR face images to HR feature maps and then combines facial attributes with these features to generate the intermediate HR results with rational attributes. In the second stage, a Structure Enhancement Network (SE-Net) is built. It simultaneously extracts face features and estimates facial boundary heatmaps from the inputs, and then fuses them to output the final HR face images. Extensive experiments demonstrate that our method achieves superior super-resolved results and outperforms the state-of-the-art methods.
引用
收藏
页码:468 / 483
页数:16
相关论文
共 65 条
[1]  
[Anonymous], 2017, CVPR
[2]  
Arjovsky M., 2017, arXiv preprint arXiv:1701.07875
[3]  
Baker S., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P83, DOI 10.1109/AFGR.2000.840616
[4]   Super-FAN: Integrated facial landmark localization and super-resolution of real-world low resolution faces in arbitrary poses with GANs [J].
Bulat, Adrian ;
Tzimiropoulos, Georgios .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :109-117
[5]   How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks) [J].
Bulat, Adrian ;
Tzimiropoulos, Georgios .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1021-1030
[6]   Face Recognition and Retrieval Using Cross-Age Reference Coding With Cross-Age Celebrity Dataset [J].
Chen, Bor-Chun ;
Chen, Chu-Song ;
Hsu, Winston H. .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (06) :804-815
[7]   FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors [J].
Chen, Yu ;
Tai, Ying ;
Liu, Xiaoming ;
Shen, Chunhua ;
Yang, Jian .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2492-2501
[8]   StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].
Choi, Yunjey ;
Choi, Minje ;
Kim, Munyoung ;
Ha, Jung-Woo ;
Kim, Sunghun ;
Choo, Jaegul .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797
[9]   Generative Adversarial Networks An overview [J].
Creswell, Antonia ;
White, Tom ;
Dumoulin, Vincent ;
Arulkumaran, Kai ;
Sengupta, Biswa ;
Bharath, Anil A. .
IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) :53-65
[10]   Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition [J].
Ding, Changxing ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :1002-1014