MTLFuseNet: A novel emotion recognition model based on deep latent feature fusion of EEG signals and multi-task learning

被引:22
作者
Li, Rui [1 ]
Ren, Chao [1 ]
Ge, Yiqing [1 ]
Zhao, Qiqi [1 ]
Yang, Yikun [1 ]
Shi, Yuhan [1 ]
Zhang, Xiaowei [1 ]
Hu, Bin [1 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Gansu Prov Key Lab Wearable Comp, Lanzhou 730000, Peoples R China
基金
中国国家自然科学基金;
关键词
Emotion recognition; EEG; Feature fusion; Multi -task learning; NEURAL-NETWORK; SYSTEM; LSTM;
D O I
10.1016/j.knosys.2023.110756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How to extract discriminative latent feature representations from electroencephalography (EEG) signals and build a generalized model is a topic in EEG-based emotion recognition research. This study proposed a novel emotion recognition model based on deep latent feature fusion of EEG signals and multi-task learning, referred to as MTLFuseNet. MTLFuseNet learned spatio-temporal latent features of EEG in an unsupervised manner by a variational autoencoder (VAE) and learned the spatio-spectral features of EEG in a supervised manner by a graph convolutional network (GCN) and gated recurrent unit (GRU) network. Afterward, the two latent features were fused to form more complementary and discriminative spatio-temporal-spectral fusion features for EEG signal representation. In addition, MTLFuseNet was constructed based on multi-task learning. The focal loss was introduced to solve the problem of unbalanced sample classes in an emotional dataset, and the triplet-center loss was introduced to make the fused latent feature vectors more discriminative. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on two public datasets, DEAP and DREAMER. On the DEAP dataset, the average accuracies of valence and arousal are 71.33% and 73.28%, respectively. On the DREAMER dataset, the average accuracies of valence and arousal are 80.43% and 83.33%, respectively. The experimental results show that the proposed MTLFuseNet model achieves excellent recognition performance, outperforming the state-of-the-art methods.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 69 条
  • [1] Semi-Supervised EEG Signals Classification System for Epileptic Seizure Detection
    Abdelhameed, Ahmed M.
    Bayoumi, Magdy
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (12) : 1922 - 1926
  • [2] Emotions Recognition Using EEG Signals: A Survey
    Alarcao, Soraia M.
    Fonseca, Manuel J.
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (03) : 374 - 393
  • [3] Alhagry S, 2017, INT J ADV COMPUT SC, V8, P355, DOI 10.14569/IJACSA.2017.081046
  • [4] Anubhav, 2020, 2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), P88, DOI [10.1109/CSPA48992.2020.9068691, 10.1109/cspa48992.2020.9068691]
  • [5] MIN2Net: End-to-End Multi-Task Learning for Subject-Independent Motor Imagery EEG Classification
    Autthasan, Phairot
    Chaisaen, Rattanaphon
    Sudhawiyangkul, Thapanun
    Rangpong, Phurin
    Kiatthaveephong, Suktipol
    Dilokthanakul, Nat
    Bhakdisongkhram, Gun
    Phan, Huy
    Guan, Cuntai
    Wilaiprasitporn, Theerawit
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2022, 69 (06) : 2105 - 2118
  • [6] Unsupervised Learning
    Barlow, H. B.
    [J]. NEURAL COMPUTATION, 1989, 1 (03) : 295 - 311
  • [7] Calibration free meta learning based approach for subject independent EEG emotion recognition
    Bhosale, Swapnil
    Chakraborty, Rupayan
    Kopparapu, Sunil Kumar
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 72
  • [8] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [9] Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, 10.48550/arXiv.1412.3555]
  • [10] Where Does EEG Come From and What Does It Mean?
    Cohen, Michael X.
    [J]. TRENDS IN NEUROSCIENCES, 2017, 40 (04) : 208 - 218