Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild

被引:146
|
作者
Zhao, Zengqun [1 ]
Liu, Qingshan [1 ]
Wang, Shanmin [2 ,3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[3] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Face recognition; Image recognition; Faces; Convolution; Image reconstruction; Geometry; Facial expression recognition; deep convolutional neural networks; multi-scale; local attention; INFORMATION; PATCHES; JOINT; POSE;
D O I
10.1109/TIP.2021.3093397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expression recognition (FER) in the wild received broad concerns in which occlusion and pose variation are two key issues. This paper proposed a global multi-scale and local attention network (MA-Net) for FER in the wild. Specifically, the proposed network consists of three main components: a feature pre-extractor, a multi-scale module, and a local attention module. The feature pre-extractor is utilized to pre-extract middle-level features, the multi-scale module to fuse features with different receptive fields, which reduces the susceptibility of deeper convolution towards occlusion and variant pose, while the local attention module can guide the network to focus on local salient features, which releases the interference of occlusion and non-frontal pose problems on FER in the wild. Extensive experiments demonstrate that the proposed MA-Net achieves the state-of-the-art results on several in-the-wild FER benchmarks: CAER-S, AffectNet-7, AffectNet-8, RAFDB, and SFEW with accuracies of 88.42%, 64.53%, 60.29%, 88.40%, and 59.40% respectively. The codes and training logs are publicly available at https://github.com/zengqunzhao/MA-Net.
引用
收藏
页码:6544 / 6556
页数:13
相关论文
共 50 条
  • [31] Learning Expression Features via Deep Residual Attention Networks for Facial Expression Recognition From Video Sequences
    Zhao, Xiaoming
    Chen, Gang
    Chuang, Yuelong
    Tao, Xin
    Zhang, Shiqing
    IETE TECHNICAL REVIEW, 2021, 38 (06) : 602 - 610
  • [32] Attention-Based Deep Neural Network Combined Local and Global Features for Indoor Scene Recognition
    Chen, Luefeng
    Duan, Wenhao
    Li, Jiazhuo
    Wu, Min
    Pedrycz, Witold
    Hirota, Kaoru
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (11) : 12684 - 12693
  • [33] Feature fusion of multi-granularity and multi-scale for facial expression recognition
    Xia, Haiying
    Lu, Lidan
    Song, Shuxiang
    VISUAL COMPUTER, 2024, 40 (03): : 2035 - 2047
  • [34] Feature fusion of multi-granularity and multi-scale for facial expression recognition
    Haiying Xia
    Lidan Lu
    Shuxiang Song
    The Visual Computer, 2024, 40 : 2035 - 2047
  • [35] Learning Multi-Scale Knowledge-Guided Features for Text-Guided Face Recognition
    Hasan, Md Mahedi
    Sami, Shoaib Meraj
    Nasrabadi, Nasser M.
    Dawson, Jeremy
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (02): : 195 - 209
  • [36] Facial Expression Recognition by Multi-Scale CNN with Regularized Center Loss
    Li, Zhenghao
    Wu, Song
    Xiao, Guoqiang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3384 - 3389
  • [37] LQGDNet: A Local Quaternion and Global Deep Network for Facial Depression Recognition
    Shang, Yuanyuan
    Pan, Yuchen
    Jiang, Xiao
    Shao, Zhuhong
    Guo, Guodong
    Liu, Tie
    Ding, Hui
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2557 - 2563
  • [38] Joint spatial and scale attention network for multi-view facial expression recognition
    Liu, Yuanyuan
    Peng, Jiyao
    Dai, Wei
    Zeng, Jiabei
    Shan, Shiguang
    PATTERN RECOGNITION, 2023, 139
  • [39] Facial Expression Recognition under Partial Occlusion Based on Fusion of Global and Local Features
    Wang, Xiaohua
    Xia, Chen
    Hu, Min
    Ren, Fuji
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [40] Discriminative attention-augmented feature learning for facial expression recognition in the wild
    Zhou, Linyi
    Fan, Xijian
    Tjahjadi, Tardi
    Das Choudhury, Sruti
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 925 - 936