Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism

被引:0
|
作者
Peng, Cheng [1 ]
Sun, Mingqi [2 ]
Zou, Kun [1 ]
Zhang, Bowen [3 ]
Dai, Genan [3 ]
Tsoi, Ah Chung [4 ]
机构
[1] Univ Elect Sci & Technol China, Zhongshan Inst, Sch Comp, Zhongshan 528402, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Shenzhen Technol Univ, Coll Big Data & Internet, Shenzhen 518118, Peoples R China
[4] Univ Wollongong, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia
关键词
facial expression recognition; visual state space model; attention; object detection;
D O I
10.3390/s24216912
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In studying the joint object detection and classification problem for facial expression recognition (FER) deploying the YOLOX framework, we introduce a novel feature extractor, called neighborhood coordinate attention Mamba (NCAMamba) to substitute for the original feature extractor in the Feature Pyramid Network (FPN). NCAMamba combines the background information reduction capabilities of Mamba, the local neighborhood relationship understanding of neighborhood attention, and the directional relationship understanding of coordinate attention. The resulting FER-YOLO-NCAMamba model, when applied to two unaligned FER benchmark datasets, RAF-DB and SFEW, obtains significantly improved mean average precision (mAP) scores when compared with those obtained by other state-of-the-art methods. Moreover, in ablation studies, it is found that the NCA module is relatively more important than the Visual State Space (VSS), a version of using Mamba for image processing, and in visualization studies using the grad-CAM method, it reveals that regions around the nose tip are critical to recognizing the expression; if it is too large, it may lead to erroneous prediction, while a small focused region would lead to correct recognition; this may explain why FER of unaligned faces is such a challenging problem.
引用
收藏
页数:20
相关论文
共 47 条
  • [11] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
    Zhang, Lifeng
    Zheng, Xiangwei
    Chen, Xuanchi
    Ren, Xiuxiu
    Ji, Cun
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6109 - 6124
  • [12] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
    Lifeng Zhang
    Xiangwei Zheng
    Xuanchi Chen
    Xiuxiu Ren
    Cun Ji
    Neural Processing Letters, 2023, 55 : 6109 - 6124
  • [13] Facial Expression Recognition Based on Vision Transformer with Hybrid Local Attention
    Tian, Yuan
    Zhu, Jingxuan
    Yao, Huang
    Chen, Di
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [14] Facial Expression Recognition Based on Region Enhanced Attention Network
    Gongguan C.
    Fan Z.
    Hua W.
    Hui F.
    Caiming Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (01): : 152 - 160
  • [15] A Framework for Facial Expression Recognition Combining Contextual Information and Attention Mechanism
    Chen, Jianzeng
    Chen, Ningning
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2024, 20 (04): : 535 - 549
  • [16] Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism
    Li, Yong
    Zeng, Jiabei
    Shan, Shiguang
    Chen, Xilin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2439 - 2450
  • [17] Facial Expression Recognition Methods in the Wild Based on Fusion Feature of Attention Mechanism and LBP
    Liao, Jun
    Lin, Yuanchang
    Ma, Tengyun
    He, Songxiying
    Liu, Xiaofang
    He, Guotian
    SENSORS, 2023, 23 (09)
  • [18] Facial Expression Recognition Based on Feature Representation Learning and Clustering-Based Attention Mechanism
    Jin, Lianghai
    Guo, Liyuan
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (02): : 182 - 194
  • [19] An Improved Facial Expression Recognition using CNN-BiLSTM with Attention Mechanism
    Jayaraman, Samanthisvaran
    Mahendran, Anand
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 1307 - 1315
  • [20] Facial Expression Recognition Using Enhanced Convolution Neural Network with Attention Mechanism
    Prabhu, K.
    SathishKumar, S.
    Sivachitra, M.
    Dineshkumar, S.
    Sathiyabama, P.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 41 (01): : 415 - 426