A novel multi-scale facial expression recognition algorithm based on improved Res2Net for classroom scenes

被引:0
作者
Gu, Meihua [1 ]
Feng, Jing [1 ]
Chu, Yalu [1 ]
机构
[1] Xian Polytech Univ, Sch Elect Informat, Xian 710048, Peoples R China
关键词
Classroom scenes; Expression recognition; Res2Net; Multi-scale; Fine-grained coordinate attention mechanism;
D O I
10.1007/s11042-023-16115-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial expression recognition under classroom scenes can help the teacher to understand students' classroom learning status and improve teaching effectiveness. Aiming at the problem of low expression recognition accuracy in classroom scenarios, a novel multi-scale facial expression recognition algorithm based on improved Res2Net is proposed. Firstly, a bi-directional residual BiRes2Net module is proposed to achieve bi-directional multi-scale expression feature extraction at the fine-grained level, while a short-directed connection path is introduced to make the network have the self-closing capability and avoid extracting redundant information of expressions; Then the Fine-Grained Coordinate Attention (FGCA) mechanism is embedded to extract expression spatial location features and channel features at a fine-grained level by making full use of the prior knowledge of facial expressions; Finally, a multi-classification Focalloss loss function is used to alleviate the imbalance of expression data, and different weights are assigned to expression samples with different recognition difficulty so that the network is biased towards difficult sample feature extraction. The experimental results show that the recognition accuracy of the proposed method is 79.47%, 94.06%, and 96.67% in RAF-DB, JAFFE, and CK+ datasets respectively, and up to 72.71% in real classroom scenes, which are better than other comparative algorithms significantly.
引用
收藏
页码:16525 / 16542
页数:18
相关论文
共 36 条
  • [1] A novel Monogenic Directional Pattern (MDP) and pseudo-Voigt kernel for facilitating the identification of facial emotions
    Alphonse, A. Sherly
    Dharma, Dejey
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 459 - 470
  • [2] [Anonymous], 2016, MEDITERRANEAN ELECTR, DOI DOI 10.1109/MELCON.2016.7495381
  • [3] Dimitrios K, 2021, P IEEE C COMPUTER VI, DOI [10.48550/arXiv.2015.03790, DOI 10.48550/ARXIV.2015.03790]
  • [4] Res2Net: A New Multi-Scale Backbone Architecture
    Gao, Shang-Hua
    Cheng, Ming-Ming
    Zhao, Kai
    Zhang, Xin-Yu
    Yang, Ming-Hsuan
    Torr, Philip
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
  • [5] Gao T., 2021, J INTELL SYST, V17, P393, DOI [10.11992/tis.202107028, DOI 10.11992/TIS.202107028]
  • [6] Students' affective content analysis in smart classroom environment using deep learning techniques
    Gupta, Sujit Kumar
    Ashwin, T. S.
    Guddeti, Ram Mohana Reddy
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25321 - 25348
  • [7] Coordinate Attention for Efficient Mobile Network Design
    Hou, Qibin
    Zhou, Daquan
    Feng, Jiashi
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13708 - 13717
  • [8] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [9] Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network
    Ji, Yanli
    Hu, Yuhan
    Yang, Yang
    Shen, Fumin
    Shen, Heng Tao
    [J]. NEUROCOMPUTING, 2019, 333 : 231 - 239
  • [10] Li D, 2021, RES FACIAL EXPRESSIO, DOI [10.27684/d.cnki.gxndx.2021.003154, DOI 10.27684/D.CNKI.GXNDX.2021.003154]