Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild

被引:146
|
作者
Zhao, Zengqun [1 ]
Liu, Qingshan [1 ]
Wang, Shanmin [2 ,3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[3] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Face recognition; Image recognition; Faces; Convolution; Image reconstruction; Geometry; Facial expression recognition; deep convolutional neural networks; multi-scale; local attention; INFORMATION; PATCHES; JOINT; POSE;
D O I
10.1109/TIP.2021.3093397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expression recognition (FER) in the wild received broad concerns in which occlusion and pose variation are two key issues. This paper proposed a global multi-scale and local attention network (MA-Net) for FER in the wild. Specifically, the proposed network consists of three main components: a feature pre-extractor, a multi-scale module, and a local attention module. The feature pre-extractor is utilized to pre-extract middle-level features, the multi-scale module to fuse features with different receptive fields, which reduces the susceptibility of deeper convolution towards occlusion and variant pose, while the local attention module can guide the network to focus on local salient features, which releases the interference of occlusion and non-frontal pose problems on FER in the wild. Extensive experiments demonstrate that the proposed MA-Net achieves the state-of-the-art results on several in-the-wild FER benchmarks: CAER-S, AffectNet-7, AffectNet-8, RAFDB, and SFEW with accuracies of 88.42%, 64.53%, 60.29%, 88.40%, and 59.40% respectively. The codes and training logs are publicly available at https://github.com/zengqunzhao/MA-Net.
引用
收藏
页码:6544 / 6556
页数:13
相关论文
共 50 条
  • [41] Hybrid Attention-Aware Learning Network for Facial Expression Recognition in the Wild
    Gong, Weijun
    La, Zhiyao
    Qian, Yurong
    Zhou, Weihang
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 12203 - 12217
  • [42] Discriminative attention-augmented feature learning for facial expression recognition in the wild
    Linyi Zhou
    Xijian Fan
    Tardi Tjahjadi
    Sruti Das Choudhury
    Neural Computing and Applications, 2022, 34 : 925 - 936
  • [43] Facial Expression Recognition via Deep Learning
    Zhao, Xiaoming
    Shi, Xugan
    Zhang, Shiqing
    IETE TECHNICAL REVIEW, 2015, 32 (05) : 347 - 355
  • [44] Action Recognition in Radio Signals Based on Multi-Scale Deep Features
    Hao, Xiaojun
    Xu, Guangying
    Ma, Hongbin
    Yang, Shuyuan
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [45] Facial Expression Recognition by Regional Attention and Multi-task Learning
    Cui, Longlei
    Tian, Ying
    ENGINEERING LETTERS, 2021, 29 (03) : 919 - 925
  • [46] Facial Expression Recognition Based on Multiscale Features and Attention Mechanism
    Yao, Lisha
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2024, 58 (04) : 429 - 440
  • [47] MANet: Multi-Scale Attention Network for Correspondence Learning
    Chen, Yukai
    Zheng, Linxin
    Liu, Xin
    Xiao, Guobao
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1978 - 1982
  • [48] Multi-Scale Deep Representation Aggregation for Vein Recognition
    Pan, Zaiyu
    Wang, Jun
    Wang, Guoqing
    Zhu, Jihong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 1 - 15
  • [49] Dynamic facial expression recognition of sprinters based on multi-scale detail enhancement
    Cao, Xiang
    Li, Pengfei
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2022, 14 (3-4) : 336 - 351
  • [50] Deep Attention and Multi-Scale Networks for Accurate Remote Sensing Image Segmentation
    Qi, Xingqun
    Li, Kaiqi
    Liu, Pengkun
    Zhou, Xiaoguang
    Sun, Muyi
    IEEE ACCESS, 2020, 8 (08): : 146627 - 146639