MTLFuseNet: A novel emotion recognition model based on deep latent feature fusion of EEG signals and multi-task learning

被引：27

作者：

Li, Rui ^{[1
]}

Ren, Chao ^{[1
]}

Ge, Yiqing ^{[1
]}

Zhao, Qiqi ^{[1
]}

Yang, Yikun ^{[1
]}

Shi, Yuhan ^{[1
]}

Zhang, Xiaowei ^{[1
]}

Hu, Bin ^{[1
]}

机构：

[1] Lanzhou Univ, Sch Informat Sci & Engn, Gansu Prov Key Lab Wearable Comp, Lanzhou 730000, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 276卷

基金：

中国国家自然科学基金;

关键词：

Emotion recognition; EEG; Feature fusion; Multi -task learning; NEURAL-NETWORK; SYSTEM; LSTM;

D O I：

10.1016/j.knosys.2023.110756

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How to extract discriminative latent feature representations from electroencephalography (EEG) signals and build a generalized model is a topic in EEG-based emotion recognition research. This study proposed a novel emotion recognition model based on deep latent feature fusion of EEG signals and multi-task learning, referred to as MTLFuseNet. MTLFuseNet learned spatio-temporal latent features of EEG in an unsupervised manner by a variational autoencoder (VAE) and learned the spatio-spectral features of EEG in a supervised manner by a graph convolutional network (GCN) and gated recurrent unit (GRU) network. Afterward, the two latent features were fused to form more complementary and discriminative spatio-temporal-spectral fusion features for EEG signal representation. In addition, MTLFuseNet was constructed based on multi-task learning. The focal loss was introduced to solve the problem of unbalanced sample classes in an emotional dataset, and the triplet-center loss was introduced to make the fused latent feature vectors more discriminative. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on two public datasets, DEAP and DREAMER. On the DEAP dataset, the average accuracies of valence and arousal are 71.33% and 73.28%, respectively. On the DREAMER dataset, the average accuracies of valence and arousal are 80.43% and 83.33%, respectively. The experimental results show that the proposed MTLFuseNet model achieves excellent recognition performance, outperforming the state-of-the-art methods.& COPY; 2023 Elsevier B.V. All rights reserved.

引用

页数：16

共 69 条

[1] Semi-Supervised EEG Signals Classification System for Epileptic Seizure Detection [J].

Abdelhameed, Ahmed M. ;

Bayoumi, Magdy .

IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (12) :1922-1926

[2] Emotions Recognition Using EEG Signals: A Survey [J].

Alarcao, Soraia M. ;

Fonseca, Manuel J. .

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (03) :374-393

[3]

Alhagry S, 2017, INT J ADV COMPUT SC, V8, P355, DOI 10.14569/IJACSA.2017.081046

[4]

Anubhav, 2020, 2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), P88, DOI [10.1109/CSPA48992.2020.9068691, 10.1109/cspa48992.2020.9068691]

[5] MIN2Net: End-to-End Multi-Task Learning for Subject-Independent Motor Imagery EEG Classification [J].

Autthasan, Phairot ;

Chaisaen, Rattanaphon ;

Sudhawiyangkul, Thapanun ;

Rangpong, Phurin ;

Kiatthaveephong, Suktipol ;

Dilokthanakul, Nat ;

Bhakdisongkhram, Gun ;

Phan, Huy ;

Guan, Cuntai ;

Wilaiprasitporn, Theerawit .

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2022, 69 (06) :2105-2118

[6] Unsupervised Learning [J].

Barlow, H. B. .

NEURAL COMPUTATION, 1989, 1 (03) :295-311

[7] Calibration free meta learning based approach for subject independent EEG emotion recognition [J].

Bhosale, Swapnil ;

Chakraborty, Rupayan ;

Kopparapu, Sunil Kumar .

BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 72

[8] Multitask learning [J].

Caruana, R .

MACHINE LEARNING, 1997, 28 (01) :41-75

[9]

Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, 10.48550/arXiv.1412.3555]

[10] Where Does EEG Come From and What Does It Mean? [J].

Cohen, Michael X. .

TRENDS IN NEUROSCIENCES, 2017, 40 (04) :208-218

← 1 2 3 4 5 6 7 →