Orthogonal channel attention-based multi-task learning for multi-view facial expression recognition

被引：24

作者：

Chen, Jingying ^{[1
,2
]}

Yang, Lei ^{[1
]}

Tan, Lei ^{[3
]}

Xu, Ruyi ^{[2
]}

机构：

[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China

[2] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China

[3] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen 361005, Peoples R China

来源：

PATTERN RECOGNITION | 2022年 / 129卷

关键词：

Multi-view facial expression recognition; Orthogonal channel attention; Multi-task learning; Siamese convolutional neural network; Separated channel attention module;

D O I：

10.1016/j.patcog.2022.108753

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-view facial expression recognition (FER) is a challenging computer vision task due to the large intra-class difference caused by viewpoint variations. This paper presents a novel orthogonal channel attention-based multi-task learning (OCA-MTL) approach for FER. The proposed OCA-MTL approach adopts a Siamese convolutional neural network (CNN) to force the multi-view expression recognition model to learn the same features as the frontal expression recognition model. To further enhance the recognition accuracy of non-frontal expression, the multi-view expression model adopts a multi-task learning framework that regards head pose estimation (HPE) as an auxiliary task. A separated channel attention (SCA) module is embedded in the multi-task learning framework to generate individual attention for FER and HPE. Furthermore, orthogonal channel attention loss is presented to force the model to employ different feature channels to represent the facial expression and head pose, thereby decoupling them. The proposed approach is performed on two public facial expression datasets to evaluate its effectiveness and achieves an average recognition accuracy rate of 88.41 % under 13 viewpoints on Multi-PIE and 89.04% under 5 viewpoints on KDEF, outperforming state-of-the-art methods.(c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：13

共 40 条

[1]

Baddar WJ, 2019, AAAI CONF ARTIF INTE, P3215

[2] Learning Features Robust to Image Variations with Siamese Networks for Facial Expression Recognition [J].

Baddar, Wissam J. ;

Kim, Dae Hoe ;

Ro, Yong Man .

MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :189-200

[3] Residual multi-task learning for facial landmark localization and expression recognition [J].

Chen, Boyu ;

Guan, Wenlong ;

Li, Peixia ;

Ikeda, Naoki ;

Hirasawa, Kosuke ;

Lu, Huchuan .

PATTERN RECOGNITION, 2021, 115

[4]

Ekman R., 1997, What the Face Reveals: Basic and Applied Studies of Spontaneous Expression Using the Facial Action Coding System (FACS)

[5] Multi-View Facial Expression Recognition based on Multitask Learning and Generative Adversarial Network [J].

Fan, JiaJun ;

Wang, ShiPu ;

Yang, Po ;

Yang, Yun .

2020 IEEE 18TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), VOL 1, 2020, :573-578

[6]

Fischer M., 2012, 2012 IEEE Fifth International Conference On Biometrics: Theory, Applications And Systems (BTAS 2012), P331, DOI 10.1109/BTAS.2012.6374597

[7] Multiple Attention Network for Facial Expression Recognition [J].

Gan, Yanling ;

Chen, Jingying ;

Yang, Zongkai ;

Xu, Luhui .

IEEE ACCESS, 2020, 8 :7383-7393

[8] The Stability Criterion Model and Stability Analysis of Waxy Crude Oil Pipeline Transportation System Based on Excess Entropy Production [J].

Gan Yifan ;

Cheng Qinglin ;

Sun Wei ;

Gao Wei ;

Liu Xiaoyan ;

Liu Yang .

JOURNAL OF THERMAL SCIENCE, 2018, 27 (06) :541-554

[9]

Guney F., 2013, IEEE Conference on Automatic Face and Gesture Recogition, P1, DOI 10.1109/FG.2013.6553814

[10] Sex differences in scanning faces: Does attention to the eyes explain female superiority in facial expression recognition? [J].

Hall, Jessica K. ;

Hutton, Sam B. ;

Morgan, Michael J. .

COGNITION & EMOTION, 2010, 24 (04) :629-637

← 1 2 3 4 →