A Student Facial Expression Recognition Model Based on Multi-Scale and Deep Fine-Grained Feature Attention Enhancement

被引:0
作者
Shou, Zhaoyu [1 ,2 ]
Huang, Yi [1 ]
Li, Dongxu [1 ]
Feng, Cheng [1 ]
Zhang, Huibing [3 ]
Lin, Yuming [3 ]
Wu, Guangxiang [4 ]
机构
[1] Guilin Univ Elect Sci Technol, Sch Informat & Commun, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Wireless Broadband Commun & Signal Proc Ke, Guilin 541004, Peoples R China
[3] Guilin Univ Elect Technol, Sch Comp & Informat Secur, Guilin 541004, Peoples R China
[4] China Elect Technol Grp Corp, 34th Res Inst, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
facial expression recognition; smart classroom; multi-scale features; deep fine-grained features; key region-oriented attention mechanism;
D O I
10.3390/s24206748
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In smart classroom environments, accurately recognizing students' facial expressions is crucial for teachers to efficiently assess students' learning states, timely adjust teaching strategies, and enhance teaching quality and effectiveness. In this paper, we propose a student facial expression recognition model based on multi-scale and deep fine-grained feature attention enhancement (SFER-MDFAE) to address the issues of inaccurate facial feature extraction and poor robustness of facial expression recognition in smart classroom scenarios. Firstly, we construct a novel multi-scale dual-pooling feature aggregation module to capture and fuse facial information at different scales, thereby obtaining a comprehensive representation of key facial features; secondly, we design a key region-oriented attention mechanism to focus more on the nuances of facial expressions, further enhancing the representation of multi-scale deep fine-grained feature; finally, the fusion of multi-scale and deep fine-grained attention-enhanced features is used to obtain richer and more accurate facial key information and realize accurate facial expression recognition. The experimental results demonstrate that the proposed SFER-MDFAE outperforms the existing state-of-the-art methods, achieving an accuracy of 76.18% on FER2013, 92.75% on FERPlus, 92.93% on RAF-DB, 67.86% on AffectNet, and 93.74% on the real smart classroom facial expression dataset (SCFED). These results validate the effectiveness of the proposed method.
引用
收藏
页数:23
相关论文
共 44 条
  • [1] Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution
    Barsoum, Emad
    Zhang, Cha
    Ferrer, Cristian Canton
    Zhang, Zhengyou
    [J]. ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 279 - 283
  • [2] Effective affective EEG-based indicators in emotion-evoking VR environments: an evidence from machine learning
    Castiblanco Jimenez, Ivonne Angelica
    Olivetti, Elena Carlotta
    Vezzetti, Enrico
    Moos, Sandro
    Celeghin, Alessia
    Marcolin, Federica
    [J]. Neural Computing and Applications, 2024, 36 (35) : 22245 - 22263
  • [3] ArcFace: Additive Angular Margin Loss for Deep Face Recognition
    Deng, Jiankang
    Guo, Jia
    Xue, Niannan
    Zafeiriou, Stefanos
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4685 - 4694
  • [4] EmoNeXt: an Adapted ConvNeXt for Facial Emotion Recognition
    El Boudouri, Yassine
    Bohi, Amine
    [J]. 2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [5] Hierarchical scale convolutional neural network for facial expression recognition
    Fan, Xinqi
    Jiang, Mingjie
    Shahid, Ali Raza
    Yan, Hong
    [J]. COGNITIVE NEURODYNAMICS, 2022, 16 (04) : 847 - 858
  • [6] Ad-Corre: Adaptive Correlation-Based Loss for Facial Expression Recognition in the Wild
    Fard, Ali Pourramezan
    Mahoor, Mohammad H.
    [J]. IEEE ACCESS, 2022, 10 : 26756 - 26768
  • [7] Facial Expression Recognition in the Wild via Deep Attentive Center Loss
    Farzaneh, Amir Hossein
    Qi, Xiaojun
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2401 - 2410
  • [8] Attentive or Not? Toward a Machine Learning Approach to Assessing Students' Visible Engagement in Classroom Instruction
    Goldberg, Patricia
    Suemer, Oemer
    Stuermer, Kathleen
    Wagner, Wolfgang
    Goellner, Richard
    Gerjets, Peter
    Kasneci, Enkelejda
    Trautwein, Ulrich
    [J]. EDUCATIONAL PSYCHOLOGY REVIEW, 2021, 33 (01) : 27 - 49
  • [9] Goodfellow Ian J., 2013, Neural Information Processing. 20th International Conference, ICONIP 2013. Proceedings: LNCS 8228, P117, DOI 10.1007/978-3-642-42051-1_16
  • [10] MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
    Guo, Yandong
    Zhang, Lei
    Hu, Yuxiao
    He, Xiaodong
    Gao, Jianfeng
    [J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 87 - 102