A joint learning method with consistency-aware for low-resolution facial expression recognition

被引:4
作者
Xie, Yuanlun [1 ]
Tian, Wenhong [1 ]
Song, Liang [2 ]
Xue, Ruini [3 ]
Zha, Zhiyuan [4 ]
Wen, Bihan [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Sichuan Provinc, Peoples R China
[2] Tsinghua Univ, Sichuan Energy Internet Res Inst, Zone A,Tianfu New Econ Ind Pk, Chengdu 610213, Sichuan Provinc, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Facial expression recognition; Image super-resolution; Deep learning; High-level vision task;
D O I
10.1016/j.eswa.2023.123022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing facial expression recognition (FER) methods are mainly devoted to learning discriminative features from high-resolution images. However, when applied to low-resolution images, their performance drops rapidly. This paper proposes a unified learning framework (namely SR-FER) by cascading the image super-resolution (SR) task and FER task to alleviate the low-resolution challenge. It effectively feeds back expression-related information from the FER network to the SR network, and returns the quality-enhanced expression images via a SR network. Specifically, a multi-stage attention-aware consistency loss module is introduced to help the SR network achieve discriminative feature restoration guided by attention information. Furthermore, a prediction consistency loss module is also developed to encourage the SR network to restore discriminative features by reducing the difference in prediction information between the restored and original normal-resolution images. Therefore, more accurate results are obtained by performing FER on the restored images. We conduct extensive experiments to demonstrate that the proposed low-resolution FER solution can help SR methods restore features favorable for FER while maintaining acceptable FER performance in various resolution degradation scenarios. The proposed method effectively improves the FER challenge under resolution degradation conditions, which is of good reference value for real-world applications.
引用
收藏
页数:15
相关论文
共 49 条
[1]  
[Anonymous], 2011, Handbook of face recognition
[2]   Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution [J].
Barsoum, Emad ;
Zhang, Cha ;
Ferrer, Cristian Canton ;
Zhang, Zhengyou .
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :279-283
[3]  
Cheng BW, 2017, INT CONF AFFECT, P65
[4]   Accelerating the Super-Resolution Convolutional Neural Network [J].
Dong, Chao ;
Loy, Chen Change ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :391-407
[5]   Learning a Deep Convolutional Network for Image Super-Resolution [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199
[6]   Facial Expression Recognition in the Wild via Deep Attentive Center Loss [J].
Farzaneh, Amir Hossein ;
Qi, Xiaojun .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :2401-2410
[7]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[8]   Visual Attention Consistency under Image Transforms for Multi-Label Image Classification [J].
Guo, Hao ;
Zheng, Kang ;
Fan, Xiaochuan ;
Yu, Hongkai ;
Wang, Song .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :729-739
[9]   Task-Driven Super Resolution: Object Detection in Low-Resolution Images [J].
Haris, Muhammad ;
Shakhnarovich, Greg ;
Ukita, Norimichi .
NEURAL INFORMATION PROCESSING, ICONIP 2021, PT V, 2021, 1516 :387-395
[10]   Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks [J].
Hasani, Behzad ;
Mahoor, Mohammad H. .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :2278-2288