Automatic detection of students' affective states in classroom environment using hybrid convolutional neural networks

被引:75
作者
Ashwin, T. S. [1 ]
Guddeti, Ram Mohana Reddy [1 ]
机构
[1] Natl Inst Technol Karnataka Surathkal, Informat Technol Dept, Dakshina Kannada, Karnataka, India
关键词
Affective computing; Affective states; Convolutional neural network; Classroom environemnt; Facial emotion recognition; Student engagement; FACIAL EXPRESSION RECOGNITION; EMOTIONS; ENGAGEMENT;
D O I
10.1007/s10639-019-10004-6
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Predicting the students' emotional and behavioral engagements using computer vision techniques is a challenging task. Though there are several state-of-the-art techniques for analyzing a student's affective states in an e-learning environment (single person's engagement detection in a single image frame), a very few works are available for analyzing the students' affective states in a classroom environment (multiple people in a single image frame). Hence, in this paper, we propose a novel hybrid convolutional neural network (CNN) architecture for analyzing the students' affective states in a classroom environment. This proposed architecture consists of two models, the first model (CNN-1) is designed to analyze the affective states of a single student in a single image frame and the second model (CNN-2) uses multiple students in a single image frame. Thus, our proposed hybrid architecture predicts the overall affective state of the entire class. The proposed architecture uses the students' facial expressions, hand gestures and body postures for analyzing their affective states. Further, due to unavailability of standard datasets for the students' affective state analysis, we created, annotated and tested on our dataset of over 8000 single face in a single image frame and 12000 multiple faces in a single image frame with three different affective states, namely: engaged, boredom and neutral. The experimental results demonstrate an accuracy of 86% and 70% for posed and spontaneous affective states of classroom data, respectively.
引用
收藏
页码:1387 / 1415
页数:29
相关论文
共 70 条
[1]   Synthesis of Realistic Facial Expressions Using Expression Map [J].
Agarwal, Swapna ;
Mukherjee, Dipti Prasad .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) :902-914
[2]  
AIQIN Z, 2006, INT C INF AUT 2006 I, P245
[3]  
[Anonymous], 2005, P 17 C COMP LING SPE
[4]  
Arifin S., 2007, Proceedings of the 15th International Conference on Multimedia, P68, DOI DOI 10.1145/1291233.1291251
[5]   An E-learning System With Multifacial Emotion Recognition Using Supervised Machine Learning [J].
Ashwin, T. S. ;
Jose, Jijo ;
Raghu, G. ;
Reddy, G. Ram Mohana .
2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E), 2015, :23-26
[6]  
Balaam M, 2010, CHI2010: PROCEEDINGS OF THE 28TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, P1623
[7]   Affective Video Content Analysis: A Multidisciplinary Insight [J].
Baveye, Yoann ;
Chamaret, Christel ;
Dellandrea, Emmanuel ;
Chen, Liming .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) :396-409
[8]   The Affective Experience of Novice Computer Programmers [J].
Bosch N. ;
D’Mello S. .
International Journal of Artificial Intelligence in Education, 2017, 27 (1) :181-206
[9]   Using Video to Automatically Detect Learner Affect in Computer-Enabled Classrooms [J].
Bosch, Nigel ;
D'Mello, Sidney K. ;
Ocumpaugh, Jaclyn ;
Baker, Ryan S. ;
Shute, Valerie .
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2016, 6 (02)
[10]  
CHEN J, 2006, 7 INT C COMP AID IND, P1