Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild

被引：146

作者：

Zhao, Zengqun ^{[1
]}

Liu, Qingshan ^{[1
]}

Wang, Shanmin ^{[2
,3
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China

[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China

[3] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2021年 / 30卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Face recognition; Image recognition; Faces; Convolution; Image reconstruction; Geometry; Facial expression recognition; deep convolutional neural networks; multi-scale; local attention; INFORMATION; PATCHES; JOINT; POSE;

D O I：

10.1109/TIP.2021.3093397

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Facial expression recognition (FER) in the wild received broad concerns in which occlusion and pose variation are two key issues. This paper proposed a global multi-scale and local attention network (MA-Net) for FER in the wild. Specifically, the proposed network consists of three main components: a feature pre-extractor, a multi-scale module, and a local attention module. The feature pre-extractor is utilized to pre-extract middle-level features, the multi-scale module to fuse features with different receptive fields, which reduces the susceptibility of deeper convolution towards occlusion and variant pose, while the local attention module can guide the network to focus on local salient features, which releases the interference of occlusion and non-frontal pose problems on FER in the wild. Extensive experiments demonstrate that the proposed MA-Net achieves the state-of-the-art results on several in-the-wild FER benchmarks: CAER-S, AffectNet-7, AffectNet-8, RAFDB, and SFEW with accuracies of 88.42%, 64.53%, 60.29%, 88.40%, and 59.40% respectively. The codes and training logs are publicly available at https://github.com/zengqunzhao/MA-Net.

引用

页码：6544 / 6556

页数：13

共 50 条

[31] Learning Expression Features via Deep Residual Attention Networks for Facial Expression Recognition From Video Sequences
Zhao, Xiaoming
Chen, Gang
Chuang, Yuelong
Tao, Xin
Zhang, Shiqing
IETE TECHNICAL REVIEW, 2021, 38 (06) : 602 - 610
[32] Attention-Based Deep Neural Network Combined Local and Global Features for Indoor Scene Recognition
Chen, Luefeng
Duan, Wenhao
Li, Jiazhuo
Wu, Min
Pedrycz, Witold
Hirota, Kaoru
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (11) : 12684 - 12693
[33] Feature fusion of multi-granularity and multi-scale for facial expression recognition
Xia, Haiying
Lu, Lidan
Song, Shuxiang
VISUAL COMPUTER, 2024, 40 (03): : 2035 - 2047
[34] Feature fusion of multi-granularity and multi-scale for facial expression recognition
Haiying Xia
Lidan Lu
Shuxiang Song
The Visual Computer, 2024, 40 : 2035 - 2047
[35] Learning Multi-Scale Knowledge-Guided Features for Text-Guided Face Recognition
Hasan, Md Mahedi
Sami, Shoaib Meraj
Nasrabadi, Nasser M.
Dawson, Jeremy
IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (02): : 195 - 209
[36] Facial Expression Recognition by Multi-Scale CNN with Regularized Center Loss
Li, Zhenghao
Wu, Song
Xiao, Guoqiang
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3384 - 3389
[37] LQGDNet: A Local Quaternion and Global Deep Network for Facial Depression Recognition
Shang, Yuanyuan
Pan, Yuchen
Jiang, Xiao
Shao, Zhuhong
Guo, Guodong
Liu, Tie
Ding, Hui
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2557 - 2563
[38] Joint spatial and scale attention network for multi-view facial expression recognition
Liu, Yuanyuan
Peng, Jiyao
Dai, Wei
Zeng, Jiabei
Shan, Shiguang
PATTERN RECOGNITION, 2023, 139
[39] Facial Expression Recognition under Partial Occlusion Based on Fusion of Global and Local Features
Wang, Xiaohua
Xia, Chen
Hu, Min
Ren, Fuji
NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
[40] Discriminative attention-augmented feature learning for facial expression recognition in the wild
Zhou, Linyi
Fan, Xijian
Tjahjadi, Tardi
Das Choudhury, Sruti
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 925 - 936

← 1 2 3 4 5 →