Emotion recognition using hierarchical spatial-temporal learning transformer from regional to global brain

被引:4
|
作者
Cheng, Cheng [1 ]
Liu, Wenzhe [2 ]
Feng, Lin [1 ,3 ]
Jia, Ziyu [4 ]
机构
[1] Dalian Univ Technol, Dept Comp Sci & Technol, Dalian, Peoples R China
[2] Huzhou Univ, Sch Informat Engn, Huzhou, Peoples R China
[3] Dalian Minzu Univ, Sch Informat & Commun Engn, Dlian, Peoples R China
[4] Univ Chinese Acad Sci, Chinese Acad Sci, Brainnetome Ctr, Inst Automat, Beijing, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Emotion recognition; Electroencephalogram (EEG); Transformer; Spatiotemporal features; EEG; FUSION;
D O I
10.1016/j.neunet.2024.106624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition is an essential but challenging task in human-computer interaction systems due to the distinctive spatial structures and dynamic temporal dependencies associated with each emotion. However, current approaches fail to accurately capture the intricate effects of electroencephalogram (EEG) signals across different brain regions on emotion recognition. Therefore, this paper designs a transformer-based method, denoted by R2G-STLT, which relies on a spatial-temporal transformer encoder with regional to global hierarchical learning that learns the representative spatiotemporal features from the electrode level to the brain-region level. The regional spatial-temporal transformer (RST-Trans) encoder is designed to obtain spatial information and context dependence at the electrode level aiming to learn the regional spatiotemporal features. Then, the global spatial-temporal transformer (GST-Trans) encoder is utilized to extract reliable global spatiotemporal features, reflecting the impact of various brain regions on emotion recognition tasks. Moreover, the multi-head attention mechanism is placed into the GST-Trans encoder to empower it to capture the longrange spatial-temporal information among the brain regions. Finally, subject-independent experiments are conducted on each frequency band of the DEAP, SEED, and SEED-IV datasets to assess the performance of the proposed model. Results indicate that the R2G-STLT model surpasses several state-of-the-art approaches.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Feature hypergraph representation learning on spatial-temporal correlations for EEG emotion recognition
    Menghang Li
    Min Qiu
    Li Zhu
    Wanzeng Kong
    Cognitive Neurodynamics, 2023, 17 : 1271 - 1281
  • [12] Spatial-Temporal Transformer for Crime Recognition in Surveillance Videos
    Boekhoudt, Kayleigh
    Talavera, Estefania
    2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [13] Spatial-Temporal Recurrent Neural Network for Emotion Recognition
    Zhang, Tong
    Zheng, Wenming
    Cui, Zhen
    Zong, Yuan
    Li, Yang
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (03) : 839 - 847
  • [14] Adaptive Spatial-Temporal Aware Graph Learning for EEG-Based Emotion Recognition
    Ye, Weishan
    Wang, Jiyuan
    Chen, Lin
    Dai, Lifei
    Sun, Zhe
    Liang, Zhen
    CYBORG AND BIONIC SYSTEMS, 2024, 5
  • [15] STGATE: Spatial-temporal graph attention network with a transformer encoder for EEG-based emotion recognition
    Li, Jingcong
    Pan, Weijian
    Huang, Haiyun
    Pan, Jiahui
    Wang, Fei
    FRONTIERS IN HUMAN NEUROSCIENCE, 2023, 17
  • [16] Hierarchy Spatial-Temporal Transformer for Action Recognition in Short Videos
    Cai, Guoyong
    Cai, Yumeng
    FUZZY SYSTEMS AND DATA MINING VI, 2020, 331 : 760 - 774
  • [17] GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal Transformer
    Li, Shuaicheng
    Cao, Qianggang
    Liu, Lingbo
    Yang, Kunlin
    Liu, Shinan
    Hou, Jun
    Yi, Shuai
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13648 - 13657
  • [18] Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition
    Liu, Xiaodong
    Xu, Huating
    Wang, Miao
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [19] Convolution spatial-temporal attention network for EEG emotion recognition
    Cao, Lei
    Yu, Binlong
    Dong, Yilin
    Liu, Tianyu
    Li, Jie
    PHYSIOLOGICAL MEASUREMENT, 2024, 45 (12)
  • [20] Multimodal Fusion of Spatial-Temporal Features for Emotion Recognition in the Wild
    Wang, Zuchen
    Fang, Yuchun
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 205 - 214