MTLFuseNet: A novel emotion recognition model based on deep latent feature fusion of EEG signals and multi-task learning

被引:27
作者
Li, Rui [1 ]
Ren, Chao [1 ]
Ge, Yiqing [1 ]
Zhao, Qiqi [1 ]
Yang, Yikun [1 ]
Shi, Yuhan [1 ]
Zhang, Xiaowei [1 ]
Hu, Bin [1 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Gansu Prov Key Lab Wearable Comp, Lanzhou 730000, Peoples R China
基金
中国国家自然科学基金;
关键词
Emotion recognition; EEG; Feature fusion; Multi -task learning; NEURAL-NETWORK; SYSTEM; LSTM;
D O I
10.1016/j.knosys.2023.110756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How to extract discriminative latent feature representations from electroencephalography (EEG) signals and build a generalized model is a topic in EEG-based emotion recognition research. This study proposed a novel emotion recognition model based on deep latent feature fusion of EEG signals and multi-task learning, referred to as MTLFuseNet. MTLFuseNet learned spatio-temporal latent features of EEG in an unsupervised manner by a variational autoencoder (VAE) and learned the spatio-spectral features of EEG in a supervised manner by a graph convolutional network (GCN) and gated recurrent unit (GRU) network. Afterward, the two latent features were fused to form more complementary and discriminative spatio-temporal-spectral fusion features for EEG signal representation. In addition, MTLFuseNet was constructed based on multi-task learning. The focal loss was introduced to solve the problem of unbalanced sample classes in an emotional dataset, and the triplet-center loss was introduced to make the fused latent feature vectors more discriminative. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on two public datasets, DEAP and DREAMER. On the DEAP dataset, the average accuracies of valence and arousal are 71.33% and 73.28%, respectively. On the DREAMER dataset, the average accuracies of valence and arousal are 80.43% and 83.33%, respectively. The experimental results show that the proposed MTLFuseNet model achieves excellent recognition performance, outperforming the state-of-the-art methods.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 69 条
[1]   Semi-Supervised EEG Signals Classification System for Epileptic Seizure Detection [J].
Abdelhameed, Ahmed M. ;
Bayoumi, Magdy .
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (12) :1922-1926
[2]   Emotions Recognition Using EEG Signals: A Survey [J].
Alarcao, Soraia M. ;
Fonseca, Manuel J. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (03) :374-393
[3]  
Alhagry S, 2017, INT J ADV COMPUT SC, V8, P355, DOI 10.14569/IJACSA.2017.081046
[4]  
Anubhav, 2020, 2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), P88, DOI [10.1109/CSPA48992.2020.9068691, 10.1109/cspa48992.2020.9068691]
[5]   MIN2Net: End-to-End Multi-Task Learning for Subject-Independent Motor Imagery EEG Classification [J].
Autthasan, Phairot ;
Chaisaen, Rattanaphon ;
Sudhawiyangkul, Thapanun ;
Rangpong, Phurin ;
Kiatthaveephong, Suktipol ;
Dilokthanakul, Nat ;
Bhakdisongkhram, Gun ;
Phan, Huy ;
Guan, Cuntai ;
Wilaiprasitporn, Theerawit .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2022, 69 (06) :2105-2118
[6]   Unsupervised Learning [J].
Barlow, H. B. .
NEURAL COMPUTATION, 1989, 1 (03) :295-311
[7]   Calibration free meta learning based approach for subject independent EEG emotion recognition [J].
Bhosale, Swapnil ;
Chakraborty, Rupayan ;
Kopparapu, Sunil Kumar .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 72
[8]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[9]  
Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, 10.48550/arXiv.1412.3555]
[10]   Where Does EEG Come From and What Does It Mean? [J].
Cohen, Michael X. .
TRENDS IN NEUROSCIENCES, 2017, 40 (04) :208-218