Emotion recognition using hierarchical spatial-temporal learning transformer from regional to global brain

被引:4
|
作者
Cheng, Cheng [1 ]
Liu, Wenzhe [2 ]
Feng, Lin [1 ,3 ]
Jia, Ziyu [4 ]
机构
[1] Dalian Univ Technol, Dept Comp Sci & Technol, Dalian, Peoples R China
[2] Huzhou Univ, Sch Informat Engn, Huzhou, Peoples R China
[3] Dalian Minzu Univ, Sch Informat & Commun Engn, Dlian, Peoples R China
[4] Univ Chinese Acad Sci, Chinese Acad Sci, Brainnetome Ctr, Inst Automat, Beijing, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Emotion recognition; Electroencephalogram (EEG); Transformer; Spatiotemporal features; EEG; FUSION;
D O I
10.1016/j.neunet.2024.106624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition is an essential but challenging task in human-computer interaction systems due to the distinctive spatial structures and dynamic temporal dependencies associated with each emotion. However, current approaches fail to accurately capture the intricate effects of electroencephalogram (EEG) signals across different brain regions on emotion recognition. Therefore, this paper designs a transformer-based method, denoted by R2G-STLT, which relies on a spatial-temporal transformer encoder with regional to global hierarchical learning that learns the representative spatiotemporal features from the electrode level to the brain-region level. The regional spatial-temporal transformer (RST-Trans) encoder is designed to obtain spatial information and context dependence at the electrode level aiming to learn the regional spatiotemporal features. Then, the global spatial-temporal transformer (GST-Trans) encoder is utilized to extract reliable global spatiotemporal features, reflecting the impact of various brain regions on emotion recognition tasks. Moreover, the multi-head attention mechanism is placed into the GST-Trans encoder to empower it to capture the longrange spatial-temporal information among the brain regions. Finally, subject-independent experiments are conducted on each frequency band of the DEAP, SEED, and SEED-IV datasets to assess the performance of the proposed model. Results indicate that the R2G-STLT model surpasses several state-of-the-art approaches.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Emotion Recognition Using Hierarchical Spatiotemporal Electroencephalogram Information from Local to Global Brain Regions
    Jeong, Dong-Ki
    Kim, Hyoung-Gook
    Kim, Jin-Young
    BIOENGINEERING-BASEL, 2023, 10 (09):
  • [22] Hierarchical Spatial-Temporal Masked Contrast for Skeleton Action Recognition
    Cao, Wenming
    Zhang, Aoyu
    He, Zhihai
    Zhang, Yicha
    Yin, Xinpeng
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 5801 - 5814
  • [23] STILN: A novel spatial-temporal information learning network for EEG-based emotion recognition
    Tang, Yiheng
    Wang, Yongxiong
    Zhang, Xiaoli
    Wang, Zhe
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [24] Spatial-Temporal Constraint Learning for Cross-Subject EEG-Based Emotion Recognition
    Li, Wei
    Hou, Bowen
    Shao, Shitong
    Huan, Wei
    Tian, Ye
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [25] A Dual Attention Spatial-Temporal Graph Convolutional Network for Emotion Recognition from Gait
    Liu, Jiaqing
    Kisita, Shoji
    Chai, Shurong
    Tateyama, Tomoko
    Iwamoto, Yutaro
    Chen, Yen-Wei
    Journal of the Institute of Image Electronics Engineers of Japan, 2022, 51 (04): : 309 - 317
  • [26] Learning a spatial-temporal texture transformer network for video inpainting
    Ma, Pengsen
    Xue, Tao
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [27] Spatial-temporal transformer for end-to-end sign language recognition
    Cui, Zhenchao
    Zhang, Wenbo
    Li, Zhaoxin
    Wang, Zhaoqi
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4645 - 4656
  • [28] ASTT: acoustic spatial-temporal transformer for short utterance speaker recognition
    Wu, Xing
    Li, Ruixuan
    Deng, Bin
    Zhao, Ming
    Du, Xingyue
    Wang, Jianjia
    Ding, Kai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) : 33039 - 33061
  • [29] Spatial-Temporal Transformer Network for Continuous Action Recognition in Industrial Assembly
    Huang, Jianfeng
    Liu, Xiang
    Hu, Huan
    Tang, Shanghua
    Li, Chenyang
    Zhao, Shaoan
    Lin, Yimin
    Wang, Kai
    Liu, Zhaoxiang
    Lian, Shiguo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT X, ICIC 2024, 2024, 14871 : 114 - 130
  • [30] ASTT: acoustic spatial-temporal transformer for short utterance speaker recognition
    Xing Wu
    Ruixuan Li
    Bin Deng
    Ming Zhao
    Xingyue Du
    Jianjia Wang
    Kai Ding
    Multimedia Tools and Applications, 2023, 82 : 33039 - 33061