TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition

被引:9
|
作者
Gao, Jixun [1 ]
Zhao, Yuanyuan [2 ]
机构
[1] Henan Univ Engn, Dept Comp Sci, Zhengzhou, Peoples R China
[2] Zhengzhou Univ Technol, Dept Comp Sci, Zhengzhou, Peoples R China
来源
FRONTIERS IN NEUROROBOTICS | 2021年 / 15卷
关键词
affective computing; facial expression recognition; occlusion; transformer; deep learning;
D O I
10.3389/fnbot.2021.763100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expression recognition (FER) in uncontrolled environment is challenging due to various un-constrained conditions. Although existing deep learning-based FER approaches have been quite promising in recognizing frontal faces, they still struggle to accurately identify the facial expressions on the faces that are partly occluded in unconstrained scenarios. To mitigate this issue, we propose a transformer-based FER method (TFE) that is capable of adaptatively focusing on the most important and unoccluded facial regions. TFE is based on the multi-head self-attention mechanism that can flexibly attend to a sequence of image patches to encode the critical cues for FER. Compared with traditional transformer, the novelty of TFE is two-fold: (i) To effectively select the discriminative facial regions, we integrate all the attention weights in various transformer layers into an attention map to guide the network to perceive the important facial regions. (ii) Given an input occluded facial image, we use a decoder to reconstruct the corresponding non-occluded face. Thus, TFE is capable of inferring the occluded regions to better recognize the facial expressions. We evaluate the proposed TFE on the two prevalent in-the-wild facial expression datasets (AffectNet and RAF-DB) and the their modifications with artificial occlusions. Experimental results show that TFE improves the recognition accuracy on both the non-occluded faces and occluded faces. Compared with other state-of-the-art FE methods, TFE obtains consistent improvements. Visualization results show TFE is capable of automatically focusing on the discriminative and non-occluded facial regions for robust FER.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Context Transformer and Adaptive Method with Visual Transformer for Robust Facial Expression Recognition
    Xiong, Lingxin
    Zhang, Jicun
    Zheng, Xiaojia
    Wang, Yuxin
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [32] Hybrid Attention-Aware Learning Network for Facial Expression Recognition in the Wild
    Gong, Weijun
    La, Zhiyao
    Qian, Yurong
    Zhou, Weihang
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 12203 - 12217
  • [33] Multi-Relations Aware Network for In-the-Wild Facial Expression Recognition
    Chen, Dongliang
    Wen, Guihua
    Li, Huihui
    Chen, Rui
    Li, Cheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3848 - 3859
  • [34] OAFormer: Occlusion Aware Transformer for Camouflaged Object Detection
    Yang, Xin
    Zhu, Hengliang
    Mao, Guojun
    Xing, Shuli
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1421 - 1426
  • [35] Random Gabor based templates for facial expression recognition in images with facial occlusion
    Zhang, Ligang
    Tjondronegoro, Dian
    Chandran, Vinod
    NEUROCOMPUTING, 2014, 145 : 451 - 464
  • [36] Pose-Aware Facial Expression Recognition Assisted by Expression Descriptions
    Wang, Shangfei
    Wu, Yi
    Chang, Yanan
    Li, Guoming
    Mao, Meng
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (01) : 241 - 253
  • [37] VaBTFER: An Effective Variant Binary Transformer for Facial Expression Recognition
    Shen, Lei
    Jin, Xing
    SENSORS, 2024, 24 (01)
  • [38] Research on Facial Expression Recognition Algorithm Based on Lightweight Transformer
    Jiang, Bin
    Li, Nanxing
    Cui, Xiaomei
    Liu, Weihua
    Yu, Zeqi
    Xie, Yongheng
    INFORMATION, 2024, 15 (06)
  • [39] Swin-FER: Swin Transformer for Facial Expression Recognition
    Bie, Mei
    Xu, Huan
    Gao, Yan
    Song, Kai
    Che, Xiangjiu
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [40] Former-DFER: Dynamic Facial Expression Recognition Transformer
    Zhao, Zengqun
    Liu, Qingshan
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1553 - 1561