Automated accurate emotion recognition system using rhythm-specific deep convolutional neural network technique with multi-channel EEG signals

被引:82
作者
Maheshwari, Daksh [1 ]
Ghosh, S. K. [1 ]
Tripathy, R. K. [1 ]
Sharma, Manish [2 ]
Acharya, U. Rajendra [3 ,4 ,5 ]
机构
[1] BITS Pilani, Dept Elect & Elect Engn, Hyderabad Campus, Hyderabad 500078, India
[2] IITRAM, Dept Elect & Comp Sci Engn, Ahmadabad, Gujarat, India
[3] Ngee Ann Polytech, Dept Elect & Comp Engn, Singapore, Singapore
[4] Asia Univ, Dept Bioinformat & Med Engn, Taichung, Taiwan
[5] Kumamoto Univ, Int Res Org Adv Sci & Technol, Kumamoto, Japan
关键词
Emotion recognition; Multi-channel EEG; Rhythms; Deep CNN; Channel selection; Classification; FEATURE-EXTRACTION; FEATURE-SELECTION; DESYNCHRONIZATION; SYNCHRONIZATION;
D O I
10.1016/j.compbiomed.2021.104428
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Emotion is interpreted as a psycho-physiological process, and it is associated with personality, behavior, motivation, and character of a person. The objective of affective computing is to recognize different types of emotions for human-computer interaction (HCI) applications. The spatiotemporal brain electrical activity is measured using multi-channel electroencephalogram (EEG) signals. Automated emotion recognition using multi-channel EEG signals is an exciting research topic in cognitive neuroscience and affective computing. This paper proposes the rhythm-specific multi-channel convolutional neural network (CNN) based approach for automated emotion recognition using multi-channel EEG signals. The delta (delta), theta (theta), alpha (alpha), beta (beta), and gamma (gamma) rhythms of EEG signal for each channel are evaluated using band-pass filters. The EEG rhythms from the selected channels coupled with deep CNN are used for emotion classification tasks such as low-valence (LV) vs. high valence (HV), low-arousal (LA) vs. high-arousal (HA), and low-dominance (LD) vs. high dominance (HD) respectively. The deep CNN architecture considered in the proposed work has eight convolutions, three average pooling, four batch-normalization, three spatial drop-outs, two drop-outs, one global average pooling and, three dense layers. We have validated our developed model using three publicly available databases: DEAP, DREAMER, and DASPS. The results reveal that the proposed multivariate deep CNN approach coupled with beta-rhythm has obtained the accuracy values of 98.91%, 98.45%, and 98.69% for LV vs. HV, LA vs. HA, and LD vs. HD emotion classification strategies, respectively using DEAP database with 10-fold cross-validation (CV) scheme. Similarly, the accuracy values of 98.56%, 98.82%, and 98.99% are obtained for LV vs. HV, LA vs. HA, and LD vs. HD classification schemes, respectively, using deep CNN and theta-rhythm. The proposed multi-channel rhythm-specific deep CNN classification model has obtained the average accuracy value of 57.14% using alpha-rhythm and trial-specific CV using DASPS database. Moreover, for 8-quadrant based emotion classification strategy, the deep CNN based classifier has obtained an overall accuracy value of 24.37% using gamma-rhythms of multi-channel EEG signals. Our developed deep CNN model can be used for real-time automated emotion recognition applications.
引用
收藏
页数:15
相关论文
共 73 条
[1]   Analysis of evoked EEG synchronization and desynchronization in conditions of emotional activation in humans: Temporal and topographic characteristics [J].
Aftanas L.I. ;
Reva N.V. ;
Varlamov A.A. ;
Pavlov S.V. ;
Makhnev V.P. .
Neuroscience and Behavioral Physiology, 2004, 34 (8) :859-867
[2]   Time-dependent cortical asymmetries induced by emotional arousal: EEG analysis of event-related synchronization and desynchronization in individually defined frequency bands [J].
Aftanas, LI ;
Varlamov, AA ;
Pavlov, SV ;
Makhnev, VP ;
Reva, NV .
INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2002, 44 (01) :67-82
[3]   EEG-Based Emotion Recognition Using Quadratic Time-Frequency Distribution [J].
Alazrai, Rami ;
Homoud, Rasha ;
Alwanni, Hisham ;
Daoud, Mohammad I. .
SENSORS, 2018, 18 (08)
[4]  
Alhagry S, 2017, INT J ADV COMPUT SC, V8, P355, DOI 10.14569/IJACSA.2017.081046
[5]   Fusing highly dimensional energy and connectivity features to identify affective states from EEG signals [J].
Arnau-Gonzalez, Pablo ;
Arevalillo-Herraez, Miguel ;
Ramzan, Naeem .
NEUROCOMPUTING, 2017, 244 :81-89
[6]   Psychological stimulation for anxious states detection based on EEG-related features [J].
Baghdadi, Asma ;
Aribi, Yassine ;
Fourati, Rahma ;
Halouani, Najla ;
Siarry, Patrick ;
Alimi, Adel .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (08) :8519-8533
[7]   EEG differences between eyes-closed and eyes-open resting conditions [J].
Barry, Robert J. ;
Clarke, Adam R. ;
Johnstone, Stuart J. ;
Magee, Christopher A. ;
Rushby, Jacqueline A. .
CLINICAL NEUROPHYSIOLOGY, 2007, 118 (12) :2765-2773
[8]   EEG differences in children between eyes-closed and eyes-open resting conditions [J].
Barry, Robert J. ;
Clarke, Adam R. ;
Johnstone, Stuart J. ;
Brown, Christopher R. .
CLINICAL NEUROPHYSIOLOGY, 2009, 120 (10) :1806-1811
[9]   A Novel Multivariate-Multiscale Approach for Computing EEG Spectral and Temporal Complexity for Human Emotion Recognition [J].
Bhattacharyya, Abhijit ;
Tripathy, Rajesh Kumar ;
Garg, Lalit ;
Pachori, Ram Bilas .
IEEE SENSORS JOURNAL, 2021, 21 (03) :3579-3591
[10]  
Bisong E., 2019, Building machine learning and deep learning models on Google cloud platform: a comprehensive guide for beginners, P59, DOI [DOI 10.1007/978-1-4842-4470-8_19, DOI 10.1007/978-1-4842-4470-8_7]