Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism

被引:0
|
作者
Peng, Cheng [1 ]
Sun, Mingqi [2 ]
Zou, Kun [1 ]
Zhang, Bowen [3 ]
Dai, Genan [3 ]
Tsoi, Ah Chung [4 ]
机构
[1] Univ Elect Sci & Technol China, Zhongshan Inst, Sch Comp, Zhongshan 528402, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Shenzhen Technol Univ, Coll Big Data & Internet, Shenzhen 518118, Peoples R China
[4] Univ Wollongong, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia
关键词
facial expression recognition; visual state space model; attention; object detection;
D O I
10.3390/s24216912
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In studying the joint object detection and classification problem for facial expression recognition (FER) deploying the YOLOX framework, we introduce a novel feature extractor, called neighborhood coordinate attention Mamba (NCAMamba) to substitute for the original feature extractor in the Feature Pyramid Network (FPN). NCAMamba combines the background information reduction capabilities of Mamba, the local neighborhood relationship understanding of neighborhood attention, and the directional relationship understanding of coordinate attention. The resulting FER-YOLO-NCAMamba model, when applied to two unaligned FER benchmark datasets, RAF-DB and SFEW, obtains significantly improved mean average precision (mAP) scores when compared with those obtained by other state-of-the-art methods. Moreover, in ablation studies, it is found that the NCA module is relatively more important than the Visual State Space (VSS), a version of using Mamba for image processing, and in visualization studies using the grad-CAM method, it reveals that regions around the nose tip are critical to recognizing the expression; if it is too large, it may lead to erroneous prediction, while a small focused region would lead to correct recognition; this may explain why FER of unaligned faces is such a challenging problem.
引用
收藏
页数:20
相关论文
共 47 条
  • [1] Facial Expression Recognition with Attention Mechanism
    Wang, Caixia
    Wang, Zhihui
    Cui, Dong
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [2] An Innovative Neighbor Attention Mechanism Based on Coordinates for the Recognition of Facial Expressions
    Peng, Cheng
    Li, Bohao
    Zou, Kun
    Zhang, Bowen
    Dai, Genan
    Tsoi, Ah Chung
    SENSORS, 2024, 24 (22)
  • [3] Facial expression recognition based on facial part attention mechanism
    Zhong, Qiubo
    Fang, Baofu
    Wei, Shenbin
    Wang, Zaijun
    Zhang, Haoxiang
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [4] Facial Expression Recognition Network Based on Attention Mechanism
    Zhang W.
    Li P.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (07): : 706 - 713
  • [5] Facial Expression Recognition Based on Multiscale Features and Attention Mechanism
    Yao, Lisha
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2024, 58 (04) : 429 - 440
  • [6] Attention mechanism-based CNN for facial expression recognition
    Li, Jing
    Jin, Kan
    Zhou, Dalin
    Kubota, Naoyuki
    Ju, Zhaojie
    NEUROCOMPUTING, 2020, 411 : 340 - 350
  • [7] FACIAL EXPRESSION RECOGNITION ALGORITHM BASED ON MULTI-ATTENTION MECHANISM
    Wu, Huixin
    Huang, Zehuan
    Jiang, Wei
    Zhao, Xin
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2023, 19 (04): : 1239 - 1250
  • [8] Facial Expression Recognition Based on Dual Scale Hybrid Attention mechanism
    Peng Yongjia
    Xin, Jin
    2023 5TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2023, : 240 - 244
  • [9] Facial expression recognition based on attention mechanism ResNet lightweight network
    Zhao Xiao
    Yang Chen
    Wang Ruo-nan
    Li Yue-chen
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (11) : 1503 - 1510
  • [10] A visual attention based ROI detection method for facial expression recognition
    Sun, Wenyun
    Zhao, Haitao
    Jin, Zhong
    NEUROCOMPUTING, 2018, 296 : 12 - 22