Identifying and Monitoring Students' Classroom Learning Behavior Based on Multisource Information

被引：17

作者：

Albert, Chuck Chung Yin ^{[1
]}

Sun, Yuqi ^{[2
]}

Li, Guang ^{[1
]}

Peng, Jun ^{[3
]}

Ran, Feng ^{[4
]}

Wang, Zheng ^{[5
]}

Zhou, Jie ^{[3
]}

机构：

[1] City Univ Macau, Fac Data Sci, Taipa, Macau, Peoples R China

[2] Minist Educ PRC, Macao Polytech Inst, Engn Res Ctr Appl Technol Machine Translat & Artif, Macau, Peoples R China

[3] City Univ Macau, Sch Educ, Taipa, Macao, Peoples R China

[4] Beijing Dongcheng Acad Educ Sci, Beijing, Peoples R China

[5] Shandong Youth Univ Polit Sci, Sch Informat Engn, Jinan, Peoples R China

来源：

MOBILE INFORMATION SYSTEMS | 2022年 / 2022卷

关键词：

D O I：

10.1155/2022/9903342

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Understanding human activity and behavior, particularly real-time understanding in video feeds, is one of the most active areas of research in Computer Vision (CV) and Artificial Intelligence (AI) nowadays. To advance the topic of integrating learning engagement research with university teaching practice, accurate and efficient assessment, and analysis of students' classroom learning behavior engagement is very important. The recently proposed classroom behavior recognition algorithms have some limitations, such as the inability to quickly and accurately identify students' classroom behaviors because they do not consider the motion information of students between consecutive frames. In recent years, action recognition algorithms based on Convolutional Neural Networks (CNN) have improved significantly. To address the limitations of existing algorithms, in this study, a 3D-CNN is selected as a network model for classroom student behavior recognition, which increases information multisourcing and classroom student localization with high accuracy and robustness. For better analysis of human behavior in videos, the 3D convolution extends the 2D convolution to the spatial-temporal domain. In the proposed system, first of all, a real-time picture stream of each student is obtained by combining real-time target detection and tracking. Then, a deep spatiotemporal residual CNN is used to learn the spatiotemporal features of each student's behavior, so, as to achieve real-time recognition of classroom behaviors for multistudent targets in classroom teaching scenarios. To verify the effectiveness of the proposed model, different experiments are conducted using the labeled classroom behavior dataset. The experimental results demonstrate that the proposed model exhibits better performance in classroom behavior recognition. The accurate recognition of classroom behaviors can assist the teachers and students to understand the classroom learning situation and help to promote the development of smart classroom.

引用

页数：8

共 34 条

[1]

[Anonymous], 2014, P ACCV

[2] ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis [J].

Basiri, Mohammad Ehsan ;

Nemati, Shahla ;

Abdar, Moloud ;

Cambria, Erik ;

Acharya, U. Rajendra .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 :279-294

[3] Up Next: Retrieval Methods for Large Scale Related Video Suggestion [J].

Bendersky, Michael ;

Garcia-Pueyo, Lluis ;

Harmsen, Jeremiah ;

Josifovski, Vanja ;

Lepikhin, Dima .

PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :1769-1778

[4] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[5] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].

Carreira, Joao ;

Zisserman, Andrew .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733

[6] Fall Detection Based on Key Points of Human-Skeleton Using OpenPose [J].

Chen, Weiming ;

Jiang, Zijie ;

Guo, Hailin ;

Ni, Xiaoyang .

SYMMETRY-BASEL, 2020, 12 (05)

[7]

Dollar P., 2005, Behavior recognition via sparse spatiotemporal feature

[8] Learning Spatiotemporal Features with 3D Convolutional Networks [J].

Du Tran ;

Bourdev, Lubomir ;

Fergus, Rob ;

Torresani, Lorenzo ;

Paluri, Manohar .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497

[9] Temporal Localization of Actions with Actoms [J].

Gaidon, Adrien ;

Harchaoui, Zaid ;

Schmid, Cordelia .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2782-2795

[10] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

← 1 2 3 4 →