TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition

被引:9
|
作者
Gao, Jixun [1 ]
Zhao, Yuanyuan [2 ]
机构
[1] Henan Univ Engn, Dept Comp Sci, Zhengzhou, Peoples R China
[2] Zhengzhou Univ Technol, Dept Comp Sci, Zhengzhou, Peoples R China
来源
FRONTIERS IN NEUROROBOTICS | 2021年 / 15卷
关键词
affective computing; facial expression recognition; occlusion; transformer; deep learning;
D O I
10.3389/fnbot.2021.763100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expression recognition (FER) in uncontrolled environment is challenging due to various un-constrained conditions. Although existing deep learning-based FER approaches have been quite promising in recognizing frontal faces, they still struggle to accurately identify the facial expressions on the faces that are partly occluded in unconstrained scenarios. To mitigate this issue, we propose a transformer-based FER method (TFE) that is capable of adaptatively focusing on the most important and unoccluded facial regions. TFE is based on the multi-head self-attention mechanism that can flexibly attend to a sequence of image patches to encode the critical cues for FER. Compared with traditional transformer, the novelty of TFE is two-fold: (i) To effectively select the discriminative facial regions, we integrate all the attention weights in various transformer layers into an attention map to guide the network to perceive the important facial regions. (ii) Given an input occluded facial image, we use a decoder to reconstruct the corresponding non-occluded face. Thus, TFE is capable of inferring the occluded regions to better recognize the facial expressions. We evaluate the proposed TFE on the two prevalent in-the-wild facial expression datasets (AffectNet and RAF-DB) and the their modifications with artificial occlusions. Experimental results show that TFE improves the recognition accuracy on both the non-occluded faces and occluded faces. Compared with other state-of-the-art FE methods, TFE obtains consistent improvements. Visualization results show TFE is capable of automatically focusing on the discriminative and non-occluded facial regions for robust FER.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] AVT: AU-ASSISTED VISUAL TRANSFORMER FOR FACIAL EXPRESSION RECOGNITION
    Jin, Rijin
    Zhao, Sirui
    Hao, Zhongkai
    Xu, Yifan
    Xu, Tong
    Chen, Enhong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2661 - 2665
  • [22] GAT-Net: A Network using Grid Attention and Transformer for Dynamic Facial Expression Recognition
    Dai, Wnxin
    Dai, Yaping
    Hirota, Kaoru
    Shao, Shuai
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 5072 - 5077
  • [23] Deep Neural Network Architecture: Application for Facial Expression Recognition
    Garcia, M.
    Ramirez, S.
    IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (07) : 1311 - 1319
  • [24] Identity-aware convolutional neural networks for facial expression recognition
    Chongsheng Zhang
    Pengyou Wang
    Ke Chen
    Joni-Kristian Kmrinen
    Journal of Systems Engineering and Electronics, 2017, 28 (04) : 784 - 792
  • [25] An analysis of facial expression recognition under partial facial image occlusion
    Kotsia, Irene
    Buciu, Loan
    Pitas, Loannis
    IMAGE AND VISION COMPUTING, 2008, 26 (07) : 1052 - 1067
  • [26] Facial Expression Recognition Based on Squeeze Vision Transformer
    Kim, Sangwon
    Nam, Jaeyeal
    Ko, Byoung Chul
    SENSORS, 2022, 22 (10)
  • [27] Identity-aware convolutional neural networks for facial expression recognition
    Zhang, Chongsheng
    Wang, Pengyou
    Chen, Ke
    Kamarainen, Joni-Kristian
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2017, 28 (04) : 784 - 792
  • [28] Complexity aware center loss for facial expression recognition
    Li, Huihui
    Yuan, Xu
    Xu, Chunlin
    Zhang, Rui
    Liu, Xiaoyong
    Liu, Lianqi
    VISUAL COMPUTER, 2024, 40 (11) : 8045 - 8054
  • [29] Target-aware transformer tracking with hard occlusion instance generation
    Xiao, Dingkun
    Wei, Zhenzhong
    Zhang, Guangjun
    FRONTIERS IN NEUROROBOTICS, 2024, 17
  • [30] COMPACT SELECTIVE TRANSFORMER BASED ON INFORMATION ENTROPY FOR FACIAL EXPRESSION RECOGNITION IN THE WILD
    Guo, Liyuan
    Jin, Lianghai
    Ma, Guangzhi
    Xu, Xiangyang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2345 - 2349