Facial Expression Recognition Based on Vision Transformer with Hybrid Local Attention

被引:1
|
作者
Tian, Yuan [1 ]
Zhu, Jingxuan [1 ]
Yao, Huang [1 ]
Chen, Di [1 ]
机构
[1] Cent China Normal Univ, Fac Artificial Intelligence Educ, Wuhan 430079, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期
关键词
facial expression recognition; attention; vision transformer;
D O I
10.3390/app14156471
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Facial expression recognition has wide application prospects in many occasions. Due to the complexity and variability of facial expressions, facial expression recognition has become a very challenging research topic. This paper proposes a Vision Transformer expression recognition method based on hybrid local attention (HLA-ViT). The network adopts a dual-stream structure. One stream extracts the hybrid local features and the other stream extracts the global contextual features. These two streams constitute a global-local fusion attention. The hybrid local attention module is proposed to enhance the network's robustness to face occlusion and head pose variations. The convolutional neural network is combined with the hybrid local attention module to obtain feature maps with local prominent information. Robust features are then captured by the ViT from the global perspective of the visual sequence context. Finally, the decision-level fusion mechanism fuses the expression features with local prominent information, adding complementary information to enhance the network's recognition performance and robustness against interference factors such as occlusion and head posture changes in natural scenes. Extensive experiments demonstrate that our HLA-ViT network achieves an excellent performance with 90.45% on RAF-DB, 90.13% on FERPlus, and 65.07% on AffectNet.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Facial expression recognition based on local representation
    Chen C.
    Wang H.
    Huang L.
    Huang T.
    Li L.
    Huang X.
    Dai S.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (05): : 100 - 109
  • [22] Survey of facial expression recognition based on computer vision
    School of Information Engineering, Beijing University of Science and Technology, Beijing 100083, China
    Jisuanji Gongcheng, 2006, 11 (231-233):
  • [23] AUTOMATIC RECOGNITION OF FACIAL EXPRESSION BASED ON COMPUTER VISION
    Zhu, Shaoping
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2015, 8 (03): : 1464 - 1483
  • [24] Facial Expression Recognition Based on Hybrid Approach
    Mannan, Md Abdul
    Lam, Antony
    Kobayashi, Yoshinori
    Kuno, Yoshinori
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 304 - 310
  • [25] Facial Expression Recognition Network Based on Attention Mechanism
    Zhang W.
    Li P.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (07): : 706 - 713
  • [26] Facial Expression Recognition Based on Attention Mechanism and Involution
    Guo, Jingyuan
    Dong, Yishan
    Liu, Xiaowen
    Lu, Shuhua
    Computer Engineering and Applications, 2023, 59 (23) : 95 - 103
  • [27] Facial Micro-Expression Recognition Enhanced by Score Fusion and a Hybrid Model from Convolutional LSTM and Vision Transformer
    Zheng, Yufeng
    Blasch, Erik
    SENSORS, 2023, 23 (12)
  • [28] Research on Facial Expression Recognition Algorithm Based on Lightweight Transformer
    Jiang, Bin
    Li, Nanxing
    Cui, Xiaomei
    Liu, Weihua
    Yu, Zeqi
    Xie, Yongheng
    INFORMATION, 2024, 15 (06)
  • [29] Vision and Attention Theory Based Sampling for Continuous Facial Emotion Recognition
    Cruz, Albert C.
    Bhanu, Bir
    Thakoor, Ninad S.
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2014, 5 (04) : 418 - 431
  • [30] HaViT: Hybrid-Attention Based Vision Transformer for Video Classification
    Li, Li
    Zhuang, Liansheng
    Gao, Shenghua
    Wang, Shafei
    COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 502 - 517