Seeing Through the Mask: Recognition of Genuine Emotion Through Masked Facial Expression

被引:4
作者
Zhou, Ju [1 ]
Liu, Xinyu [1 ]
Wang, Hanpu [1 ]
Zhang, Zheyuan [1 ]
Chen, Tong [1 ,2 ]
Fu, Xiaolan [2 ,3 ]
Liu, Guangyuan [1 ]
机构
[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China
[2] Chinese Acad Sci, Inst Psychol, State Key Lab Brain & Cognit Sci, Beijing 100101, Peoples R China
[3] Univ Chinese Acad Sci, Dept Psychol, Beijing 100049, Peoples R China
关键词
Emotion recognition; Videos; Face recognition; Task analysis; Feature extraction; Transformers; Convolutional neural networks; Intensity modulation; Vision sensors; Decoupled convolution; dynamic action unit intensity features (DAIFs); emotion recognition; hidden emotion; masked facial expression (MFE); vision Transformer (ViT); DATABASE; MODEL;
D O I
10.1109/TCSS.2024.3404611
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The purpose of facial expression recognition is to recognize the corresponding emotions. However, people tend to hide their emotions by displaying facial expressions that differ from those evoked by emotions. These inconsistent facial expressions are referred to as masked facial expressions (MFEs). The automatic recognition of hidden emotions within an MFE using image data is challenging. In this study, we find distinctive movement patterns in the facial action units (AUs) of MFE sequences through a detailed analysis. Considering our findings, we propose handcrafted features called dynamic AU intensity features (DAIFs) to represent AU movement. Furthermore, we develop a decoupled AU transformer (DAUT) model for recognition, where the decoupled convolution operators ensure that the temporal information in the DAIF is not damaged. To further improve the recognition performance, we design self-supervised clip prediction for pretraining of DAUT. Experimental results demonstrate that our proposed method performs exceptionally well across all tasks in the MFE dataset, particularly improving accuracy by nearly double on the most challenging 36-class task. This suggests that leveraging temporal information from facial AU movements is a reliable and effective technique for recognizing MFEs.
引用
收藏
页码:7159 / 7172
页数:14
相关论文
共 45 条
[21]   CAS(ME)3: A Third Generation Facial Spontaneous Micro-Expression Database With Depth Information and High Ecological Validity [J].
Li, Jingting ;
Dong, Zizhao ;
Lu, Shaoyuan ;
Wang, Su-Jing ;
Yan, Wen-Jing ;
Ma, Yinhuan ;
Liu, Ye ;
Huang, Changbing ;
Fu, Xiaolan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) :2782-2800
[22]   Less is more: Micro-expression recognition from video using apex frame [J].
Liong, Sze-Teng ;
See, John ;
Wong, KokSheik ;
Phan, Raphael C. -W. .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 62 :82-92
[23]   Decoupled Networks [J].
Liu, Weiyang ;
Liu, Zhen ;
Yu, Zhiding ;
Dai, Bo ;
Lin, Rongmei ;
Wang, Yisen ;
Rehg, James M. ;
Song, Le .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2771-2779
[24]   Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].
Liu, Ze ;
Lin, Yutong ;
Cao, Yue ;
Hu, Han ;
Wei, Yixuan ;
Zhang, Zheng ;
Lin, Stephen ;
Guo, Baining .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002
[25]  
Martinez A, 2012, J MACH LEARN RES, V13, P1589
[26]   MFED: A Database for Masked Facial Expression [J].
Mo, Fan ;
Zhang, Zhihao ;
Chen, Tong ;
Zhao, Ke ;
Fu, Xiaolan .
IEEE ACCESS, 2021, 9 (09) :96279-96287
[27]   Transfer Model Collaborating Metric Learning and Dictionary Learning for Cross-Domain Facial Expression Recognition [J].
Ni, Tongguang ;
Zhang, Cong ;
Gu, Xiaoqing .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (05) :1213-1222
[28]   Multiresolution gray-scale and rotation invariant texture classification with local binary patterns [J].
Ojala, T ;
Pietikäinen, M ;
Mäenpää, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) :971-987
[29]  
Peng Wenshu., 2019, IEEE INT CONF AUTOMA, P1, DOI [DOI 10.1109/fg.2019.8756541, 10.1109/FG.2019.8756541, DOI 10.1109/FG.2019.8756541]
[30]   Reading between the lies: Identifying concealed and falsified emotions in universal facial expressions [J].
Porter, Stephen ;
ten Brinke, Leanne .
PSYCHOLOGICAL SCIENCE, 2008, 19 (05) :508-514