LEARNING DIVERSIFIED FEATURE REPRESENTATIONS FOR FACIAL EXPRESSION RECOGNITION IN THE WILD

被引：0

作者：

Heidari, Negar ^{[1
]}

Iosifidis, Alexandros ^{[1
]}

机构：

[1] Aarhus Univ, Dept Elect & Comp Engn, Aarhus, Denmark

来源：

2024 IEEE 34TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, MLSP 2024 | 2024年

关键词：

ensemble learning; facial expression recognition; attention mechanism; deep learning; feature diversity;

D O I：

10.1109/MLSP58920.2024.10734790

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Diversity of the features extracted by deep neural networks is important for enhancing the model generalization ability in different learning tasks. Facial expression recognition in the wild has attracted interest recently due to the challenges existing for extracting discriminative features from occluded images in real-world scenarios. In this paper, we propose a mechanism to diversify the features extracted by CNN layers of facial expression recognition models for enhancing the model capacity in learning discriminative features. To evaluate the effectiveness of the proposed approach, we incorporate this mechanism in two state-of-the-art models to (i) diversify local/global features in an attention-based model and (ii) diversify features extracted by different learners in an ensemble-based model. Experimental results on three well-known facial expression recognition in-the-wild datasets, AffectNet, FER+ and RAF-DB, show the effectiveness of our method, achieving state-of-the-art performance of 89.99% on RAF-DB, 89.34% on FER+ and the competitive accuracy of 60.02% on AffectNet.

引用

页数：6

共 29 条

[1] Emotion Recognition in Speech using Cross-Modal Transfer in the Wild [J].

Albanie, Samuel ;

Nagrani, Arsha ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :292-301

[2] Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution [J].

Barsoum, Emad ;

Zhang, Cha ;

Ferrer, Cristian Canton ;

Zhang, Zhengyou .

ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :279-283

[3] VGGFace2: A dataset for recognising faces across pose and age [J].

Cao, Qiong ;

Shen, Li ;

Xie, Weidi ;

Parkhi, Omkar M. ;

Zisserman, Andrew .

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74

[4] Label Distribution Learning on Auxiliary Label Space Graphs for Facial Expression Recognition [J].

Chen, Shikai ;

Wang, Jianfeng ;

Chen, Yuedong ;

Shi, Zhongchao ;

Geng, Xin ;

Rui, Yong .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13981-13990

[5]

Goodfellow Ian J., 2013, Neural Information Processing. 20th International Conference, ICONIP 2013. Proceedings: LNCS 8228, P117, DOI 10.1007/978-3-642-42051-1_16

[6] MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition [J].

Guo, Yandong ;

Zhang, Lei ;

Hu, Yuxiao ;

He, Xiaodong ;

Gao, Jianfeng .

COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :87-102

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8] Progressive Spatio-Temporal Bilinear Network with Monte Carlo Dropout for Landmark-based Facial Expression Recognition with Uncertainty Estimation [J].

Heidari, Negar ;

Iosifidis, Alexandros .

IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,

[9]

Hewitt C, 2018, Arxiv, DOI arXiv:1807.08775

[10] Facial Expression Recognition with Inconsistently Annotated Datasets [J].

Zeng, Jiabei ;

Shan, Shiguang ;

Chen, Xilin .

COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :227-243

← 1 2 3 →