CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION

被引:0
|
作者
Ramet, Gaetan [1 ,3 ]
Garner, Philip N. [2 ]
Baeriswyl, Michael [3 ]
Lazaridis, Alexandros [3 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[2] Idiap Res Inst, Martigny, Switzerland
[3] Swisscom, Artificial Intelligence & Machine Learning Grp, Bern, Switzerland
来源
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018) | 2018年
基金
欧盟地平线“2020”;
关键词
Speech Emotion Recognition; Attention; Deep Learning; Neural Network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the use of attention mechanisms to enhance the performance of the state-of-the-art deep learning model in Speech Emotion Recognition (SER). We introduce a new Long Short-Term Memory (LSTM)-based neural network attention model which is able to take into account the temporal information in speech during the computation of the attention vector. The proposed LSTM-based model is evaluated on the IEMOCAP dataset using a 5-fold cross-validation scheme and achieved 68.8% weighted accuracy on 4 classes, which outperforms the state-of-the-art models.
引用
收藏
页码:126 / 131
页数:6
相关论文
共 50 条
  • [1] Context-Aware Attention Network for Human Emotion Recognition in Video
    Liu, Xiaodong
    Wang, Miao
    ADVANCES IN MULTIMEDIA, 2020, 2020
  • [2] Regional Attention Networks with Context-aware Fusion for Group Emotion Recognition
    Khan, Ahmed Shehab
    Li, Zhiyuan
    Cai, Jie
    Tong, Yan
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1149 - 1158
  • [3] Speech Emotion Recognition using Context-Aware Dilated Convolution Network
    Kakuba, Samuel
    Han, Dong Seog
    2022 27TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2022): CREATING INNOVATIVE COMMUNICATION TECHNOLOGIES FOR POST-PANDEMIC ERA, 2022, : 601 - 604
  • [4] Context-Aware Emotion Recognition Networks
    Lee, Jiyoung
    Kim, Seungryong
    Kim, Sunok
    Park, Jungin
    Sohn, Kwanghoon
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10142 - 10151
  • [5] Speech emotion recognition with embedded attention mechanism and hierarchical context
    Cheng Y.
    Chen Y.
    Chen Y.
    Yang Y.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2019, 51 (11): : 100 - 107
  • [6] Context-aware Cascade Attention-based RNN for Video Emotion Recognition
    Sun, Man-Chin
    Hsu, Shih-Huan
    Yang, Min-Chun
    Chien, Jen-Hsien
    2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [7] Context-aware Multimodal Fusion for Emotion Recognition
    Li, Jinchao
    Wang, Shuai
    Chao, Yang
    Liu, Xunying
    Meng, Helen
    INTERSPEECH 2022, 2022, : 2013 - 2017
  • [8] VISUAL FEATURES FOR CONTEXT-AWARE SPEECH RECOGNITION
    Gupta, Abhinav
    Miao, Yajie
    Neves, Leonardo
    Metze, Florian
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5020 - 5024
  • [9] Context-aware attention network for image recognition
    Jiaxu Leng
    Ying Liu
    Shang Chen
    Neural Computing and Applications, 2019, 31 : 9295 - 9305
  • [10] Context-aware attention network for image recognition
    Leng, Jiaxu
    Liu, Ying
    Chen, Shang
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12): : 9295 - 9305