Pose-Aware Facial Expression Recognition Assisted by Expression Descriptions

被引:1
作者
Wang, Shangfei [1 ]
Wu, Yi [1 ]
Chang, Yanan [1 ]
Li, Guoming [2 ]
Mao, Meng [2 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Key Lab Comp & Commun Software Anhui Provice, Hefei 230027, Anhui, Peoples R China
[2] China Merchants Bank, AI Lab, Shenzhen 518000, Guangdong, Peoples R China
关键词
Pose-aware; expression descriptions; facial expression recognition; cross-modality attention; JOINT POSE; MULTIVIEW;
D O I
10.1109/TAFFC.2023.3267774
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although expression descriptions provide additional information about facial behaviors despite of different poses, and pose features are beneficial to adapt to pose variety, neither has been fully leveraged in facial expression recognition. This paper proposes a pose-aware text-assisted facial expression recognition method using cross-modality attention. Specifically, the method contains three components. The pose feature extractor extracts pose-related features from facial images, and then cooperates with a fully-connected layer for pose classification. When poses can be clearly discriminated and classified, features obtained from the extractor can represent the corresponding poses. To eliminate bias due to appearance and illumination, cluster centers are taken as the final pose features. The text feature extractor obtains embeddings from expression descriptions. These descriptions are first passed through Intra-Exp attention to obtain preliminary embeddings. To leverage the correlations among expressions, all expression embeddings are then concatenated and passed through Inter-Exp attention. The cross-modality module attempts to learn attention maps that distinguish the importance of facial regions by using prior knowledge about poses and expression descriptions. The image features weighted by the attention maps are utilized to recognize pose and expression jointly. Experiments on three benchmark datasets demonstrate the superiority of the proposed method.
引用
收藏
页码:241 / 253
页数:13
相关论文
共 31 条
[11]   Emotion-Preserving Representation Learning via Generative Adversarial Network for Multi-view Facial Expression Recognition [J].
Lai, Ying-Hsiu ;
Lai, Shang-Hong .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :263-270
[12]   Attention mechanism-based CNN for facial expression recognition [J].
Li, Jing ;
Jin, Kan ;
Zhou, Dalin ;
Kubota, Naoyuki ;
Ju, Zhaojie .
NEUROCOMPUTING, 2020, 411 :340-350
[13]   Deep Facial Expression Recognition: A Survey [J].
Li, Shan ;
Deng, Weihong .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) :1195-1215
[14]   Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild [J].
Li, Shan ;
Deng, Weihong ;
Du, JunPing .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2584-2593
[15]   Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism [J].
Li, Yong ;
Zeng, Jiabei ;
Shan, Shiguang ;
Chen, Xilin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) :2439-2450
[16]   Dynamic multi-channel metric network for joint pose-aware and identity-invariant facial expression recognition [J].
Liu, Yuanyuan ;
Dai, Wei ;
Fang, Fang ;
Chen, Yongquan ;
Huang, Rui ;
Wang, Run ;
Wan, Bo .
INFORMATION SCIENCES, 2021, 578 (195-213) :195-213
[17]   FERAtt: Facial Expression Recognition with Attention Net [J].
Marrero Fernandez, Pedro D. ;
Guerrero Pena, Fidel A. ;
Ren, Tsang Ing ;
Cunha, Alexandre .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :837-846
[18]   Deep Disturbance-Disentangled Learning for Facial Expression Recognition [J].
Ruan, Delian ;
Yan, Yan ;
Chen, Si ;
Xue, Jing-Hao ;
Wang, Hanzi .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2833-2841
[19]   Recognizing action units for facial expression analysis [J].
Tian, YI ;
Kanade, T ;
Cohn, JF .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (02) :97-115
[20]  
Vaswani A, 2017, ADV NEUR IN, V30