Spatial-Temporal Recurrent Neural Network for Emotion Recognition

被引:258
|
作者
Zhang, Tong [1 ,2 ]
Zheng, Wenming [3 ]
Cui, Zhen [4 ]
Zong, Yuan [3 ]
Li, Yang [1 ,2 ]
机构
[1] Southeast Univ, Key Lab Child Dev & Learning Sci, Minist Educ, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Dept Informat Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China
[3] Southeast Univ, Res Ctr Learning Sci, Minist Educ, Key Lab Child Dev & Learning Sci, Nanjing 210096, Jiangsu, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Electroencephalogram (EEG) emotion recognition; emotion recognition; facial expression recognition; spatial- temporal recurrent neural network (STRNN);
D O I
10.1109/TCYB.2017.2788081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel deep learning framework, called spatial-temporal recurrent neural network (STRNN), to integrate the feature learning from both spatial and temporal information of signal sources into a unified spatial-temporal dependency model. In STRNN, to capture those spatially co-occurrent variations of human emotions, a multidirectional recurrent neural network (RNN) layer is employed to capture long-range contextual cues by traversing the spatial regions of each temporal slice along different directions. Then a hi-directional temporal RNN layer is further used to learn the discriminative features characterizing the temporal dependencies of the sequences, where sequences are produced from the spatial RNN layer. To further select those salient regions with more discriminative ability for emotion recognition, we impose sparse projection onto those hidden states of spatial and temporal domains to improve the model discriminant ability. Consequently, the proposed two-layer RNN model provides an effective way to make use of both spatial and temporal dependencies of the input signals for emotion recognition. Experimental results on the public emotion datasets of electroencephalogram and facial expression demonstrate the proposed STRNN method is more competitive over those state-of-the-art methods.
引用
收藏
页码:839 / 847
页数:9
相关论文
共 50 条
  • [31] ASTDF-Net: Attention-Based Spatial-Temporal Dual-Stream Fusion Network for EEG-Based Emotion Recognition
    Gong, Peiliang
    Jia, Ziyu
    Wang, Pengpai
    Zhou, Yueying
    Zhang, Daoqiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 883 - 892
  • [32] Unsupervised Recurrent Neural Network with Parametric Bias Framework for Human Emotion Recognition with Multimodal Sensor Data Fusion
    Li, Jie
    Zhong, Junpei
    Wang, Min
    SENSORS AND MATERIALS, 2020, 32 (04) : 1261 - 1277
  • [33] Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network
    Shi, Jiaqi
    Liu, Chaoran
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    SENSORS, 2021, 21 (01) : 1 - 16
  • [34] Emotion Recognition with Spatial Attention and Temporal Softmax Pooling
    Aminbeidokhti, Masih
    Pedersoli, Marco
    Cardinal, Patrick
    Granger, Eric
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 323 - 331
  • [35] A randomized deep neural network for emotion recognition with landmarks detection
    Di Luzio, Francesco
    Rosato, Antonello
    Panella, Massimo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [36] EEG emotion recognition based on TQWT-features and hybrid convolutional recurrent neural network
    Zhong, Mei-yu
    Yang, Qing-yu
    Liu, Yi
    Zhen, Bo-yu
    Zhao, Feng-da
    Xie, Bei-bei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [37] A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification
    Prakash, P. Ravi
    Anuradha, D.
    Iqbal, Javid
    Galety, Mohammad Gouse
    Singh, Ruby
    Neelakandan, S.
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 54 - 63
  • [38] Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network
    Duc Le
    Aldeneh, Zakaria
    Provost, Emily Mower
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1108 - 1112
  • [39] EEG-based emotion recognition using a temporal-difference minimizing neural network
    Ju, Xiangyu
    Li, Ming
    Tian, Wenli
    Hu, Dewen
    COGNITIVE NEURODYNAMICS, 2024, 18 (02) : 405 - 416
  • [40] Facial Expression Recognition with Identity and Spatial-temporal Integrated Learning
    Teng, Halting
    Zhang, Dong
    Li, Ming
    Huang, Yudong
    2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2019, : 100 - 104