Student behavior recognition for interaction detection in the classroom environment

被引：19

作者：

Li, Yating ^{[1
,2
]}

Qi, Xin ^{[1
,2
]}

Saudagar, Abdul Khader Jilani ^{[3
]}

Badshah, Abdul Malik ^{[4
]}

Muhammad, Khan ^{[5
,6
]}

Liu, Shuai ^{[1
,2
,6
]}

机构：

[1] Hunan Normal Univ, Sch Educ Sci, Changsha 410081, Peoples R China

[2] Hunan Normal Univ, Coll Comp Sci & Engn, Changsha 410081, Peoples R China

[3] Imam Mohammad Ibn Saud Islamic Univ IMSIU, Coll Comp & Informat Sci, Informat Syst Dept, Riyadh 11432, Saudi Arabia

[4] Univ Missouri Kansas City, Dept Comp Sci & Elect Engn, Kansas City, MO USA

[5] Sungkyunkwan Univ, Coll Comp & Informat, Sch Convergence, Dept Appl Artificial Intelligence,Visual Analyt La, Seoul 03063, South Korea

[6] Sungkyunkwan Univ, Coll Comp & Informat, Sch Convergence, Seoul 03063, South Korea

来源：

IMAGE AND VISION COMPUTING | 2023年 / 136卷

关键词：

Surveillance; Relational reasoning; Human -to -object interaction; Action recognition; Smart classroom; Intelligent education;

D O I：

10.1016/j.imavis.2023.104726

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the development of multimedia technologies, surveillance videos and other multimedia data have received widespread attention in several fields. Surveillance videos can monitor students' learning statuses in real time. However, the current action recognition methods for teaching have limitations. First, the ethical privacy of AI and education makes public datasets on student behavior scarce. Therefore, based on the summarization of seven typical student behaviors in the classroom, course videos were obtained from the smart classroom to gen-erate a dataset of student behavior. Compared with existing student behavior recognition datasets, the proposed dataset is distinguished by cluttered backgrounds, crowded scenes, and occlusions. Second, relational reasoning using existing methods is not ideal for distinguishing between students' body parts and small objects in a cluttered background; the interactive utilization rate of different relational features is low, and it cannot take ad-vantage of the complementarity of different relational features, resulting in poor performance of interaction ac-tion recognition. Therefore, the attention-based relational reasoning module strengthens the interactive representation between small objects and human body parts. At the same time, considering that there is a certain complementary relationship between relational features, this study constructs a relational feature fusion module which models a human-to-human interaction relationship built upon supporting human's body part and sur-rounding context. Finally, the reconstructed features and human-appearance features were fused to achieve ac-curate interactive action recognition. Through an experimental comparison between the proposed and current mainstream algorithms on the generated student behavior dataset, it was verified that the proposed model achieves state-of-the-art performance in action recognition.& COPY; 2023 Elsevier B.V. All rights reserved.

引用

页数：9

共 38 条

[1]

Alairaji Roa'a M., 2022, Advanced Computational Paradigms and Hybrid Intelligent Computing: Proceedings of ICACCP 2021. Advances in Intelligent Systems and Computing (1373), P113, DOI 10.1007/978-981-16-4369-9_12

[2]

Alfasly Saghir, 2022, P IEEECVF C COMPUTER

[3] Object-based forgery detection in surveillance video using capsule network [J].

Bakas, Jamimamul ;

Naskar, Ruchira ;

Nappi, Michele ;

Bakshi, Sambit .

JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (4) :3781-3791

[4] Person re-identification for smart cities: State-of-the-art and the path ahead [J].

Behera, Nayan Kumar Subhashis ;

Sa, Pankaj Kumar ;

Bakshi, Sambit .

PATTERN RECOGNITION LETTERS, 2020, 138 (138) :282-289

[5]

Bottou Leon, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P421, DOI 10.1007/978-3-642-35289-8_25

[6]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[7] Any-Shot Sequential Anomaly Detection in Surveillance Videos [J].

Doshi, Keval ;

Yilmaz, Yasin .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :4037-4042

[8] Revisiting Skeleton-based Action Recognition [J].

Duan, Haodong ;

Zhao, Yue ;

Chen, Kai ;

Lin, Dahua ;

Dai, Bo .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :2959-2968

[9] Multiscale Vision Transformers [J].

Fan, Haoqi ;

Xiong, Bo ;

Mangalam, Karttikeya ;

Li, Yanghao ;

Yan, Zhicheng ;

Malik, Jitendra ;

Feichtenhofer, Christoph .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6804-6815

[10]

Faure G.J., 2023, P IEEECVF WINTER C A, P3340

← 1 2 3 4 →