Emotion Recognition from Spatio-Temporal Representation of EEG Signals via 3D-CNN with Ensemble Learning Techniques

被引:21
作者
Yuvaraj, Rajamanickam [1 ]
Baranwal, Arapan [2 ]
Prince, A. Amalin [3 ]
Murugappan, M. [4 ,5 ,6 ]
Mohammed, Javeed Shaikh [7 ]
机构
[1] Nanyang Technol Univ, Natl Inst Educ, Singapore 637616, Singapore
[2] BITS Pilani, Dept Comp Sci & Informat Syst, Sancoale 403726, Goa, India
[3] BITS Pilani, Dept Elect & Elect Engn, Sancoale 403726, Goa, India
[4] Kuwait Coll Sci & Technol, Dept Elect & Commun Engn, Intelligent Signal Proc ISP Res Lab, Block 4, Doha 13133, Kuwait
[5] Vels Inst Sci Technol & Adv Studies, Fac Engn, Dept Elect & Commun Engn, Chennai 600117, Tamilnadu, India
[6] Univ Malaysia Perlis, Ctr Excellence Unmanned Aerial Syst CoEUAS, Kangar 02600, Perlis, Malaysia
[7] Prince Sattam bin Abdulaziz Univ, Coll Appl Med Sci, Dept Biomed Technol, Al Kharj 11942, Saudi Arabia
关键词
hybrid models; 3D-CNN; deep neural networks; machine learning classifiers; emotion recognition;
D O I
10.3390/brainsci13040685
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The recognition of emotions is one of the most challenging issues in human-computer interaction (HCI). EEG signals are widely adopted as a method for recognizing emotions because of their ease of acquisition, mobility, and convenience. Deep neural networks (DNN) have provided excellent results in emotion recognition studies. Most studies, however, use other methods to extract handcrafted features, such as Pearson correlation coefficient (PCC), Principal Component Analysis, Higuchi Fractal Dimension (HFD), etc., even though DNN is capable of generating meaningful features. Furthermore, most earlier studies largely ignored spatial information between the different channels, focusing mainly on time domain and frequency domain representations. This study utilizes a pre-trained 3D-CNN MobileNet model with transfer learning on the spatio-temporal representation of EEG signals to extract features for emotion recognition. In addition to fully connected layers, hybrid models were explored using other decision layers such as multilayer perceptron (MLP), k-nearest neighbor (KNN), extreme learning machine (ELM), XGBoost (XGB), random forest (RF), and support vector machine (SVM). Additionally, this study investigates the effects of post-processing or filtering output labels. Extensive experiments were conducted on the SJTU Emotion EEG Dataset (SEED) (three classes) and SEED-IV (four classes) datasets, and the results obtained were comparable to the state-of-the-art. Based on the conventional 3D-CNN with ELM classifier, SEED and SEED-IV datasets showed a maximum accuracy of 89.18% and 81.60%, respectively. Post-filtering improved the emotional classification performance in the hybrid 3D-CNN with ELM model for SEED and SEED-IV datasets to 90.85% and 83.71%, respectively. Accordingly, spatial-temporal features extracted from the EEG, along with ensemble classifiers, were found to be the most effective in recognizing emotions compared to state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 50 条
[1]   A Novel Spatio-Temporal Field for Emotion Recognition Based on EEG Signals [J].
Li, Wei ;
Zhang, Zhen ;
Hou, Bowen ;
Li, Xiaoyu .
IEEE SENSORS JOURNAL, 2021, 21 (23) :26941-26950
[2]   Spatio-temporal CNN-BiLSTM dynamic approach to emotion recognition based on EEG signal [J].
Redwan, Usman Goni ;
Zaman, Tanha ;
Mizan, Hazzaz Bin .
Computers in Biology and Medicine, 2025, 192
[3]   Spatio-temporal graph Bert network for EEG emotion recognition [J].
Yan, Jingjie ;
Du, Chengkun ;
Li, Na ;
Zhou, Xiaoyang ;
Liu, Ying ;
Wei, Jinsheng ;
Yang, Yuan .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 104
[4]   Maximizing Emotion Recognition Accuracy with Ensemble Techniques on EEG Signals [J].
Jha S.K. ;
Suvvari S. ;
Kumar M. .
Recent Advances in Computer Science and Communications, 2024, 17 (05) :24-36
[5]   EEG emotion recognition based on the 3D-CNN and spatial-frequency attention mechanism [J].
Zhang J. ;
Zhang X. ;
Chen G. ;
Yan C. .
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (03) :191-198and205
[6]   A dual transfer learning method based on 3D-CNN and vision transformer for emotion recognition [J].
Guo, Zhifen ;
Wang, Jiao ;
Zhang, Bin ;
Ku, Yating ;
Ma, Fengbin .
APPLIED INTELLIGENCE, 2025, 55 (02)
[7]   3DCANN: A Spatio-Temporal Convolution Attention Neural Network for EEG Emotion Recognition [J].
Liu, Shuaiqi ;
Wang, Xu ;
Zhao, Ling ;
Li, Bing ;
Hu, Weiming ;
Yu, Jie ;
Zhang, Yu-Dong .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (11) :5321-5331
[8]   Spatio-Temporal Image-Based Encoded Atlases for EEG Emotion Recognition [J].
Avola, Danilo ;
Cinque, Luigi ;
Mambro, Angelo Di ;
Fagioli, Alessio ;
Marini, Marco Raoul ;
Pannone, Daniele ;
Fanini, Bruno ;
Foresti, Gian Luca .
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2024, 34 (05)
[9]   Spatio-Temporal EEG Representation Learning on Riemannian Manifold and Euclidean Space [J].
Zhang, Guangyi ;
Etemad, Ali .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02) :1469-1483
[10]   Spatio-temporal deep forest for emotion recognition based on facial electromyography signals [J].
Xu, Muhua ;
Cheng, Juan ;
Li, Chang ;
Liu, Yu ;
Chen, Xun .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 156