A novel multi-scale facial expression recognition algorithm based on improved Res2Net for classroom scenes

被引：0

作者：

Gu, Meihua ^{[1
]}

Feng, Jing ^{[1
]}

Chu, Yalu ^{[1
]}

机构：

[1] Xian Polytech Univ, Sch Elect Informat, Xian 710048, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2024年 / 83卷 / 06期

关键词：

Classroom scenes; Expression recognition; Res2Net; Multi-scale; Fine-grained coordinate attention mechanism;

D O I：

10.1007/s11042-023-16115-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Facial expression recognition under classroom scenes can help the teacher to understand students' classroom learning status and improve teaching effectiveness. Aiming at the problem of low expression recognition accuracy in classroom scenarios, a novel multi-scale facial expression recognition algorithm based on improved Res2Net is proposed. Firstly, a bi-directional residual BiRes2Net module is proposed to achieve bi-directional multi-scale expression feature extraction at the fine-grained level, while a short-directed connection path is introduced to make the network have the self-closing capability and avoid extracting redundant information of expressions; Then the Fine-Grained Coordinate Attention (FGCA) mechanism is embedded to extract expression spatial location features and channel features at a fine-grained level by making full use of the prior knowledge of facial expressions; Finally, a multi-classification Focalloss loss function is used to alleviate the imbalance of expression data, and different weights are assigned to expression samples with different recognition difficulty so that the network is biased towards difficult sample feature extraction. The experimental results show that the recognition accuracy of the proposed method is 79.47%, 94.06%, and 96.67% in RAF-DB, JAFFE, and CK+ datasets respectively, and up to 72.71% in real classroom scenes, which are better than other comparative algorithms significantly.

引用

页码：16525 / 16542

页数：18

共 36 条

[1] A novel Monogenic Directional Pattern (MDP) and pseudo-Voigt kernel for facilitating the identification of facial emotions
Alphonse, A. Sherly
Dharma, Dejey
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 459 - 470
[2] [Anonymous], 2016, MEDITERRANEAN ELECTR, DOI DOI 10.1109/MELCON.2016.7495381
[3] Dimitrios K, 2021, P IEEE C COMPUTER VI, DOI [10.48550/arXiv.2015.03790, DOI 10.48550/ARXIV.2015.03790]
[4] Res2Net: A New Multi-Scale Backbone Architecture
Gao, Shang-Hua
Cheng, Ming-Ming
Zhao, Kai
Zhang, Xin-Yu
Yang, Ming-Hsuan
Torr, Philip
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
[5] Gao T., 2021, J INTELL SYST, V17, P393, DOI [10.11992/tis.202107028, DOI 10.11992/TIS.202107028]
[6] Students' affective content analysis in smart classroom environment using deep learning techniques
Gupta, Sujit Kumar
Ashwin, T. S.
Guddeti, Ram Mohana Reddy
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25321 - 25348
[7] Coordinate Attention for Efficient Mobile Network Design
Hou, Qibin
Zhou, Daquan
Feng, Jiashi
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13708 - 13717
[8] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[9] Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network
Ji, Yanli
Hu, Yuhan
Yang, Yang
Shen, Fumin
Shen, Heng Tao
[J]. NEUROCOMPUTING, 2019, 333 : 231 - 239
[10] Li D, 2021, RES FACIAL EXPRESSIO, DOI [10.27684/d.cnki.gxndx.2021.003154, DOI 10.27684/D.CNKI.GXNDX.2021.003154]

← 1 2 3 4 →