CorrNet: Fine-Grained Emotion Recognition for Video Watching Using Wearable Physiological Sensors

被引:41
作者
Zhang, Tianyi [1 ,2 ]
El Ali, Abdallah [2 ]
Wang, Chen [3 ,4 ]
Hanjalic, Alan [1 ]
Cesar, Pablo [1 ,2 ]
机构
[1] Delft Univ Technol, Multimedia Comp Grp, NL-2600 AA Delft, Netherlands
[2] Ctr Wiskunde & Informat CWI, NL-1098XG Amsterdam, Netherlands
[3] Xinhuanet, Future Media & Convergence Inst, Beijing 100000, Peoples R China
[4] Xinhua News Agcy, State Key Lab Media Convergence Prod Technol & Sy, Beijing 100000, Peoples R China
关键词
emotion recognition; video; physiological signals; machine learning; SYSTEM; TECHNOLOGY; FRAMEWORK; SIGNALS; CONTEXT; SET;
D O I
10.3390/s21010052
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Recognizing user emotions while they watch short-form videos anytime and anywhere is essential for facilitating video content customization and personalization. However, most works either classify a single emotion per video stimuli, or are restricted to static, desktop environments. To address this, we propose a correlation-based emotion recognition algorithm (CorrNet) to recognize the valence and arousal (V-A) of each instance (fine-grained segment of signals) using only wearable, physiological signals (e.g., electrodermal activity, heart rate). CorrNet takes advantage of features both inside each instance (intra-modality features) and between different instances for the same video stimuli (correlation-based features). We first test our approach on an indoor-desktop affect dataset (CASE), and thereafter on an outdoor-mobile affect dataset (MERCA) which we collected using a smart wristband and wearable eyetracker. Results show that for subject-independent binary classification (high-low), CorrNet yields promising recognition accuracies: 76.37% and 74.03% for V-A on CASE, and 70.29% and 68.15% for V-A on MERCA. Our findings show: (1) instance segment lengths between 1-4 s result in highest recognition accuracies (2) accuracies between laboratory-grade and wearable sensors are comparable, even under low sampling rates (<= 64 Hz) (3) large amounts of neutral V-A labels, an artifact of continuous affect annotation, result in varied recognition performance.
引用
收藏
页码:1 / 25
页数:25
相关论文
共 123 条
[21]   Physiological Signal-Based Method for Measurement of Pain Intensity [J].
Chu, Yaqi ;
Zhao, Xingang ;
Han, Jianda ;
Su, Yang .
FRONTIERS IN NEUROSCIENCE, 2017, 11
[22]   Emotions detection on an ambient intelligent system using wearable devices [J].
Costa, Angelo ;
Rincon, Jaime A. ;
Carrascosa, Carlos ;
Julian, Vicente ;
Novais, Paulo .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 92 :479-489
[23]   Deep learning for electroencephalogram (EEG) classification tasks: a review [J].
Craik, Alexander ;
He, Yongtian ;
Contreras-Vidal, Jose L. .
JOURNAL OF NEURAL ENGINEERING, 2019, 16 (03)
[24]   Interoception and emotion [J].
Critchley, Hugo D. ;
Garfinkel, Sarah N. .
CURRENT OPINION IN PSYCHOLOGY, 2017, 17 :7-14
[25]  
Daniels R.W., 1974, Approximation Methods for Electronic Filter Design
[26]   Unobtrusive assessment of students’ emotional engagement during lectures using electrodermal activity sensors [J].
Di Lascio, Elena ;
Gashi, Shkurta ;
Santini, Silvia .
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2018, 2 (03)
[27]  
Ding S.X., 2020, IEEE INTELL TRANSP S, DOI [10.1109/TITS.2020.3029946, DOI 10.1109/TITS.2020.3029946]
[28]   A machine learning model for emotion recognition from physiological signals [J].
Dominguez-Jimenez, J. A. ;
Campo-Landines, K. C. ;
Martinez-Santos, J. C. ;
Delahoz, E. J. ;
Contreras-Ortiz, S. H. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 55
[29]   Driver Fatigue Detection using Recurrent Neural Networks [J].
Ed-Doughmi, Younes ;
Idrissi, Najlae .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEMS & SECURITY (NISS19), 2019,
[30]   AN ARGUMENT FOR BASIC EMOTIONS [J].
EKMAN, P .
COGNITION & EMOTION, 1992, 6 (3-4) :169-200