TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition

被引：9

作者：

Gao, Jixun ^{[1
]}

Zhao, Yuanyuan ^{[2
]}

机构：

[1] Henan Univ Engn, Dept Comp Sci, Zhengzhou, Peoples R China

[2] Zhengzhou Univ Technol, Dept Comp Sci, Zhengzhou, Peoples R China

来源：

FRONTIERS IN NEUROROBOTICS | 2021年 / 15卷

关键词：

affective computing; facial expression recognition; occlusion; transformer; deep learning;

D O I：

10.3389/fnbot.2021.763100

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Facial expression recognition (FER) in uncontrolled environment is challenging due to various un-constrained conditions. Although existing deep learning-based FER approaches have been quite promising in recognizing frontal faces, they still struggle to accurately identify the facial expressions on the faces that are partly occluded in unconstrained scenarios. To mitigate this issue, we propose a transformer-based FER method (TFE) that is capable of adaptatively focusing on the most important and unoccluded facial regions. TFE is based on the multi-head self-attention mechanism that can flexibly attend to a sequence of image patches to encode the critical cues for FER. Compared with traditional transformer, the novelty of TFE is two-fold: (i) To effectively select the discriminative facial regions, we integrate all the attention weights in various transformer layers into an attention map to guide the network to perceive the important facial regions. (ii) Given an input occluded facial image, we use a decoder to reconstruct the corresponding non-occluded face. Thus, TFE is capable of inferring the occluded regions to better recognize the facial expressions. We evaluate the proposed TFE on the two prevalent in-the-wild facial expression datasets (AffectNet and RAF-DB) and the their modifications with artificial occlusions. Experimental results show that TFE improves the recognition accuracy on both the non-occluded faces and occluded faces. Compared with other state-of-the-art FE methods, TFE obtains consistent improvements. Visualization results show TFE is capable of automatically focusing on the discriminative and non-occluded facial regions for robust FER.

引用

页数：10

共 50 条

[31] Context Transformer and Adaptive Method with Visual Transformer for Robust Facial Expression Recognition
Xiong, Lingxin
Zhang, Jicun
Zheng, Xiaojia
Wang, Yuxin
APPLIED SCIENCES-BASEL, 2024, 14 (04):
[32] Hybrid Attention-Aware Learning Network for Facial Expression Recognition in the Wild
Gong, Weijun
La, Zhiyao
Qian, Yurong
Zhou, Weihang
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 12203 - 12217
[33] Multi-Relations Aware Network for In-the-Wild Facial Expression Recognition
Chen, Dongliang
Wen, Guihua
Li, Huihui
Chen, Rui
Li, Cheng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3848 - 3859
[34] OAFormer: Occlusion Aware Transformer for Camouflaged Object Detection
Yang, Xin
Zhu, Hengliang
Mao, Guojun
Xing, Shuli
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1421 - 1426
[35] Random Gabor based templates for facial expression recognition in images with facial occlusion
Zhang, Ligang
Tjondronegoro, Dian
Chandran, Vinod
NEUROCOMPUTING, 2014, 145 : 451 - 464
[36] Pose-Aware Facial Expression Recognition Assisted by Expression Descriptions
Wang, Shangfei
Wu, Yi
Chang, Yanan
Li, Guoming
Mao, Meng
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (01) : 241 - 253
[37] VaBTFER: An Effective Variant Binary Transformer for Facial Expression Recognition
Shen, Lei
Jin, Xing
SENSORS, 2024, 24 (01)
[38] Research on Facial Expression Recognition Algorithm Based on Lightweight Transformer
Jiang, Bin
Li, Nanxing
Cui, Xiaomei
Liu, Weihua
Yu, Zeqi
Xie, Yongheng
INFORMATION, 2024, 15 (06)
[39] Swin-FER: Swin Transformer for Facial Expression Recognition
Bie, Mei
Xu, Huan
Gao, Yan
Song, Kai
Che, Xiangjiu
APPLIED SCIENCES-BASEL, 2024, 14 (14):
[40] Former-DFER: Dynamic Facial Expression Recognition Transformer
Zhao, Zengqun
Liu, Qingshan
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1553 - 1561

← 1 2 3 4 5 →