EduNet: A New Video Dataset for Understanding Human Activity in the Classroom Environment

被引:11
作者
Sharma, Vijeta [1 ,2 ]
Gupta, Manjari [2 ]
Kumar, Ajai [1 ]
Mishra, Deepti [3 ]
机构
[1] Ctr Dev Adv Comp C DAC, Pune 411008, Maharashtra, India
[2] Banaras Hindu Univ, DST Ctr Interdisciplinary Math Sci, Inst Sci, Varanasi 221005, Uttar Pradesh, India
[3] NTNU Norwegian Univ Sci & Technol, Dept Comp Sci IDI, N-2815 Gjovik, Norway
关键词
artificial intelligence; classroom activity recognition; classroom monitoring; EduNet dataset; education;
D O I
10.3390/s21175699
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Human action recognition in videos has become a popular research area in artificial intelligence (AI) technology. In the past few years, this research has accelerated in areas such as sports, daily activities, kitchen activities, etc., due to developments in the benchmarks proposed for human action recognition datasets in these areas. However, there is little research in the benchmarking datasets for human activity recognition in educational environments. Therefore, we developed a dataset of teacher and student activities to expand the research in the education domain. This paper proposes a new dataset, called EduNet, for a novel approach towards developing human action recognition datasets in classroom environments. EduNet has 20 action classes, containing around 7851 manually annotated clips extracted from YouTube videos, and recorded in an actual classroom environment. Each action category has a minimum of 200 clips, and the total duration is approximately 12 h. To the best of our knowledge, EduNet is the first dataset specially prepared for classroom monitoring for both teacher and student activities. It is also a challenging dataset of actions as it has many clips (and due to the unconstrained nature of the clips). We compared the performance of the EduNet dataset with benchmark video datasets UCF101 and HMDB51 on a standard I3D-ResNet-50 model, which resulted in 72.3% accuracy. The development of a new benchmark dataset for the education domain will benefit future research concerning classroom monitoring systems. The EduNet dataset is a collection of classroom activities from 1 to 12 standard schools.
引用
收藏
页数:18
相关论文
共 43 条
[1]  
[Anonymous], 2015, P INT C LEARN REPR
[2]   Vision-based human activity recognition: a survey [J].
Beddiar, Djamila Romaissa ;
Nini, Brahim ;
Sabokrou, Mohammad ;
Hadid, Abdenour .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) :30509-30555
[3]  
Heilbron FC, 2015, PROC CVPR IEEE, P961, DOI 10.1109/CVPR.2015.7298698
[4]  
Carreira J., 2018, arXiv
[5]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[6]  
Carreira Joao, 2019, CoRR
[7]  
Cheng YY, 2020, CHIN CONT DECIS CONF, P128, DOI 10.1109/CCDC49329.2020.9164040
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   Scaling Egocentric Vision: The EPIC-KITCHENS Dataset [J].
Damen, Dima ;
Doughty, Hazel ;
Farinella, Giovanni Maria ;
Fidler, Sanja ;
Furnari, Antonino ;
Kazakos, Evangelos ;
Moltisanti, Davide ;
Munro, Jonathan ;
Perrett, Toby ;
Price, Will ;
Wray, Michael .
COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 :753-771
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848