A Composite Network Model for Face Super-Resolution with Multi-Order Head Attention Facial Priors

被引:7
作者
Wei, Feng [1 ,2 ]
Wang, Song [3 ]
Yang, Jucheng [2 ]
Sun, Xiao [1 ,2 ]
Wang, Yuan [2 ]
Chen, Yarui [2 ]
机构
[1] Tianjin Univ Sci & Technol, Coll Mech Engn, Tianjin 300457, Peoples R China
[2] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin 300457, Peoples R China
[3] La Trobe Univ, Dept Engn, Bundoora, Vic 3086, Australia
基金
中国国家自然科学基金;
关键词
Face super -resolution; FSR; Multi -order head attention; Facial components; Prior information; Transformer;
D O I
10.1016/j.patcog.2023.109503
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face super-resolution (FSR) aims to reconstruct high-resolution face images from low-resolution (LR) ones. Despite the progress made by deep convolutional neural networks (DCNNs) on FSR, convolutions struggle to relate spatially distant concepts and what is more, all image pixels and prior information (e.g., landmarks and facial component heatmaps) are treated equally regardless of importance, causing inaccuracy and decreasing the quality of face image recovery. To address these issues, in this paper we propose a composite network model for FSR with multi-order head attention facial priors. The proposed model contains a face hallucination transformer (FHT)-based network and a multi-order head attention (MOHA)based DCNN. The FHT-based network can capture long-range dependencies and gradually increase resolution to achieve efficient and effective inference, while the MOHA-based DCNN exploits detailed and two-dimensional information of LR face images. Moreover, the novel generic submodule of the MOHAbased DCNN, namely Multi-Order Head Attention Network, can accurately model the relationship of facial components between spatial and channel dimensions. The proposed composite network model seamlessly integrates the advantages of DCNNs and transformers to super-resolve LR face images. When compared with state-of-the-art FSR methods on public benchmark datasets, the proposed model shows competitive recovery performance.
引用
收藏
页数:10
相关论文
共 33 条
[1]   Self-restrained triplet loss for accurate masked face recognition [J].
Boutros, Fadi ;
Damer, Naser ;
Kirchbuchner, Florian ;
Kuijper, Arjan .
PATTERN RECOGNITION, 2022, 124
[2]   Learning Spatial Attention for Face Super-Resolution [J].
Chen, Chaofeng ;
Gong, Dihong ;
Wang, Hao ;
Li, Zhifeng ;
Wong, Kwan-Yee K. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :1219-1231
[3]   FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors [J].
Chen, Yu ;
Tai, Ying ;
Liu, Xiaoming ;
Shen, Chunhua ;
Yang, Jian .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2492-2501
[4]  
Chudasama Vishal, 2021, IEEE Transactions on Biometrics, Behavior, and Identity Science, V3, P166, DOI 10.1109/TBIOM.2021.3059196
[5]   Second-order Attention Network for Single Image Super-Resolution [J].
Dai, Tao ;
Cai, Jianrui ;
Zhang, Yongbing ;
Xia, Shu-Tao ;
Zhang, Lei .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11057-11066
[6]  
Dosovitskiy A., 2021, INT C LEARNING REPRE
[7]   Face Hallucination Using Cascaded Super-Resolution and Identity Priors [J].
Grm, Klemen ;
Scheirer, Walter J. ;
Struc, Vitomir .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (01) :2150-2165
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]   Super-resolution of human face image using canonical correlation analysis [J].
Huang, Hua ;
He, Huiting ;
Fan, Xin ;
Zhang, Junping .
PATTERN RECOGNITION, 2010, 43 (07) :2532-2543
[10]   Deep Learning-based Face Super-resolution: A Survey [J].
Jiang, Junjun ;
Wang, Chenyang ;
Liu, Xianming ;
Ma, Jiayi .
ACM COMPUTING SURVEYS, 2023, 55 (01)