Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism

被引：0

作者：

Peng, Cheng ^{[1
]}

Sun, Mingqi ^{[2
]}

Zou, Kun ^{[1
]}

Zhang, Bowen ^{[3
]}

Dai, Genan ^{[3
]}

Tsoi, Ah Chung ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Zhongshan Inst, Sch Comp, Zhongshan 528402, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[3] Shenzhen Technol Univ, Coll Big Data & Internet, Shenzhen 518118, Peoples R China

[4] Univ Wollongong, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia

来源：

SENSORS | 2024年 / 24卷 / 21期

关键词：

facial expression recognition; visual state space model; attention; object detection;

D O I：

10.3390/s24216912

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

In studying the joint object detection and classification problem for facial expression recognition (FER) deploying the YOLOX framework, we introduce a novel feature extractor, called neighborhood coordinate attention Mamba (NCAMamba) to substitute for the original feature extractor in the Feature Pyramid Network (FPN). NCAMamba combines the background information reduction capabilities of Mamba, the local neighborhood relationship understanding of neighborhood attention, and the directional relationship understanding of coordinate attention. The resulting FER-YOLO-NCAMamba model, when applied to two unaligned FER benchmark datasets, RAF-DB and SFEW, obtains significantly improved mean average precision (mAP) scores when compared with those obtained by other state-of-the-art methods. Moreover, in ablation studies, it is found that the NCA module is relatively more important than the Visual State Space (VSS), a version of using Mamba for image processing, and in visualization studies using the grad-CAM method, it reveals that regions around the nose tip are critical to recognizing the expression; if it is too large, it may lead to erroneous prediction, while a small focused region would lead to correct recognition; this may explain why FER of unaligned faces is such a challenging problem.

引用

页数：20

共 50 条

[21] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
Zhang, Lifeng
Zheng, Xiangwei
Chen, Xuanchi
Ren, Xiuxiu
Ji, Cun
NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6109 - 6124
[22] Multi-Scale Coordinate Attention Pyramid Convolution for Facial Expression Recognition
Ni, Jinyuan
Zhang, Jianxun
Computer Engineering and Applications, 2023, 59 (22) : 242 - 250
[23] Facial Expression Recognition Based on Spatial and Channel Attention Mechanisms
Lisha Yao
Shixiong He
Kang Su
Qingtong Shao
Wireless Personal Communications, 2022, 125 : 1483 - 1500
[24] Facial Expression Recognition Based on Region Enhanced Attention Network
Gongguan C.
Fan Z.
Hua W.
Hui F.
Caiming Z.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (01): : 152 - 160
[25] An Attention Model-Based Facial Expression Recognition Algorithm
Chu Jinghui
Tang Wenhao
Zhang Shan
Lu Wei
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (12)
[26] Facial expression recognition algorithm based on efficient channel attention
Yang, Qing
Wei, Mingjun
Zhu, Rong
Zhou, Bing
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
[27] Facial Expression Recognition Based on Spatial and Channel Attention Mechanisms
Yao, Lisha
He, Shixiong
Su, Kang
Shao, Qingtong
WIRELESS PERSONAL COMMUNICATIONS, 2022, 125 (02) : 1483 - 1500
[28] Lightweight Facial Expression Recognition Method Based on Convolutional Attention
Yin Pengbo
Pan Weimin
Zhang Haijun
LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (12)
[29] Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism
Li, Yong
Zeng, Jiabei
Shan, Shiguang
Chen, Xilin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2439 - 2450
[30] Attention Mechanism and Feature Correction Fusion Model for Facial Expression Recognition
Xu, Qihua
Wang, Changlong
Hou, Yi
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 786 - 793

← 1 2 3 4 5 →