Orthogonal channel attention-based multi-task learning for multi-view facial expression recognition

被引:23
作者
Chen, Jingying [1 ,2 ]
Yang, Lei [1 ]
Tan, Lei [3 ]
Xu, Ruyi [2 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China
[2] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China
[3] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
关键词
Multi-view facial expression recognition; Orthogonal channel attention; Multi-task learning; Siamese convolutional neural network; Separated channel attention module;
D O I
10.1016/j.patcog.2022.108753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view facial expression recognition (FER) is a challenging computer vision task due to the large intra-class difference caused by viewpoint variations. This paper presents a novel orthogonal channel attention-based multi-task learning (OCA-MTL) approach for FER. The proposed OCA-MTL approach adopts a Siamese convolutional neural network (CNN) to force the multi-view expression recognition model to learn the same features as the frontal expression recognition model. To further enhance the recognition accuracy of non-frontal expression, the multi-view expression model adopts a multi-task learning framework that regards head pose estimation (HPE) as an auxiliary task. A separated channel attention (SCA) module is embedded in the multi-task learning framework to generate individual attention for FER and HPE. Furthermore, orthogonal channel attention loss is presented to force the model to employ different feature channels to represent the facial expression and head pose, thereby decoupling them. The proposed approach is performed on two public facial expression datasets to evaluate its effectiveness and achieves an average recognition accuracy rate of 88.41 % under 13 viewpoints on Multi-PIE and 89.04% under 5 viewpoints on KDEF, outperforming state-of-the-art methods.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 40 条
  • [1] Baddar WJ, 2019, AAAI CONF ARTIF INTE, P3215
  • [2] Learning Features Robust to Image Variations with Siamese Networks for Facial Expression Recognition
    Baddar, Wissam J.
    Kim, Dae Hoe
    Ro, Yong Man
    [J]. MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 189 - 200
  • [3] Residual multi-task learning for facial landmark localization and expression recognition
    Chen, Boyu
    Guan, Wenlong
    Li, Peixia
    Ikeda, Naoki
    Hirasawa, Kosuke
    Lu, Huchuan
    [J]. PATTERN RECOGNITION, 2021, 115
  • [4] Ekman R, 1997, WHAT FACE REVEALS BA
  • [5] Multi-View Facial Expression Recognition based on Multitask Learning and Generative Adversarial Network
    Fan, JiaJun
    Wang, ShiPu
    Yang, Po
    Yang, Yun
    [J]. 2020 IEEE 18TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), VOL 1, 2020, : 573 - 578
  • [6] Fischer M., 2012, 2012 IEEE Fifth International Conference On Biometrics: Theory, Applications And Systems (BTAS 2012), P331, DOI 10.1109/BTAS.2012.6374597
  • [7] Multiple Attention Network for Facial Expression Recognition
    Gan, Yanling
    Chen, Jingying
    Yang, Zongkai
    Xu, Luhui
    [J]. IEEE ACCESS, 2020, 8 : 7383 - 7393
  • [8] The Stability Criterion Model and Stability Analysis of Waxy Crude Oil Pipeline Transportation System Based on Excess Entropy Production
    Gan Yifan
    Cheng Qinglin
    Sun Wei
    Gao Wei
    Liu Xiaoyan
    Liu Yang
    [J]. JOURNAL OF THERMAL SCIENCE, 2018, 27 (06) : 541 - 554
  • [9] Guney F., 2013, IEEE Conference on Automatic Face and Gesture Recogition, P1, DOI 10.1109/FG.2013.6553814
  • [10] Sex differences in scanning faces: Does attention to the eyes explain female superiority in facial expression recognition?
    Hall, Jessica K.
    Hutton, Sam B.
    Morgan, Michael J.
    [J]. COGNITION & EMOTION, 2010, 24 (04) : 629 - 637