Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild

被引:146
|
作者
Zhao, Zengqun [1 ]
Liu, Qingshan [1 ]
Wang, Shanmin [2 ,3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[3] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Face recognition; Image recognition; Faces; Convolution; Image reconstruction; Geometry; Facial expression recognition; deep convolutional neural networks; multi-scale; local attention; INFORMATION; PATCHES; JOINT; POSE;
D O I
10.1109/TIP.2021.3093397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expression recognition (FER) in the wild received broad concerns in which occlusion and pose variation are two key issues. This paper proposed a global multi-scale and local attention network (MA-Net) for FER in the wild. Specifically, the proposed network consists of three main components: a feature pre-extractor, a multi-scale module, and a local attention module. The feature pre-extractor is utilized to pre-extract middle-level features, the multi-scale module to fuse features with different receptive fields, which reduces the susceptibility of deeper convolution towards occlusion and variant pose, while the local attention module can guide the network to focus on local salient features, which releases the interference of occlusion and non-frontal pose problems on FER in the wild. Extensive experiments demonstrate that the proposed MA-Net achieves the state-of-the-art results on several in-the-wild FER benchmarks: CAER-S, AffectNet-7, AffectNet-8, RAFDB, and SFEW with accuracies of 88.42%, 64.53%, 60.29%, 88.40%, and 59.40% respectively. The codes and training logs are publicly available at https://github.com/zengqunzhao/MA-Net.
引用
收藏
页码:6544 / 6556
页数:13
相关论文
共 50 条
  • [21] Unsupervised deep homography with multi-scale global attention
    Hu, Wei
    He, Chu
    Lin, Mingyuan
    Zhou, Haoyu
    IET IMAGE PROCESSING, 2023, 17 (10) : 2937 - 2948
  • [22] Facial Expression Recognition Based on Multi-scale CNNs
    Zhou, Shuai
    Liang, Yanyan
    Wan, Jun
    Li, Stan Z.
    BIOMETRIC RECOGNITION, 2016, 9967 : 503 - 510
  • [23] A Student Facial Expression Recognition Model Based on Multi-Scale and Deep Fine-Grained Feature Attention Enhancement
    Shou, Zhaoyu
    Huang, Yi
    Li, Dongxu
    Feng, Cheng
    Zhang, Huibing
    Lin, Yuming
    Wu, Guangxiang
    SENSORS, 2024, 24 (20)
  • [24] FACIAL EXPRESSION RECOGNITION IN THE WILD USING RICH DEEP FEATURES
    Karali, Abubakrelsedik
    Bassiouny, Ahmad
    El-Saban, Motaz
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3442 - 3446
  • [25] A multi-scale sentiment recognition network based on deep learning
    Zhang, Ning
    Zhang, Xiufeng
    2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 526 - 530
  • [26] Multi-View Gait Recognition With Joint Local Multi-Scale and Global Contextual Spatio-Temporal Features
    Zhai, Wenzhe
    Li, Haomiao
    Zheng, Chaoqun
    Xing, Xianglei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1123 - 1135
  • [27] Facial Expression Recognition from Global and a Combination of Local Features
    Praseeda, Lekshmi V.
    Sasikumar, M.
    IETE TECHNICAL REVIEW, 2009, 26 (01) : 41 - 46
  • [28] Reliable facial expression recognition for multi-scale images using weber local binary image based cosine transform features
    Sajid Ali Khan
    Ayyaz Hussain
    Muhammad Usman
    Multimedia Tools and Applications, 2018, 77 : 1133 - 1165
  • [29] Facial Expression Recognition Based on Multi-scale Feature Fusion Convolutional Neural Network and Attention Mechanism
    Wu, Yana
    Jia, Kebin
    Sun, Zhonghua
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 324 - 335
  • [30] Learning Affective Video Features for Facial Expression Recognition via Hybrid Deep Learning
    Zhang, Shiqing
    Pan, Xianzhang
    Cui, Yueli
    Zhao, Xiaoming
    Liu, Limei
    IEEE ACCESS, 2019, 7 : 32297 - 32304