AN INTERACTION-AWARE ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION IN SPOKEN DIALOGS

被引:0
|
作者
Yeh, Sung-Lin [1 ]
Lin, Yun-Shao
Lee, Chi-Chun
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
speech emotion recognition; interaction; attention mechanism; spoken dialogs;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Obtaining robust speech emotion recognition (SER) in scenarios of spoken interactions is critical to the developments of next generation human-machine interface. Previous research has largely focused on performing SER by modeling each utterance of the dialog in isolation without considering the transactional and dependent nature of the human-human conversation. In this work, we propose an interaction-aware attention network (IAAN) that incorporate contextual information in the learned vocal representation through a novel attention mechanism. Our proposed method achieves 66.3% accuracy (7.9% over baseline methods) in four class emotion recognition and is also the current state-of-art recognition rates obtained on the benchmark database.
引用
收藏
页码:6685 / 6689
页数:5
相关论文
共 50 条
  • [41] Sparse Autoencoder with Attention Mechanism for Speech Emotion Recognition
    Sun, Ting-Wei
    Wu, An-Yeu
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2019), 2019, : 146 - 149
  • [42] Multi-modal Attention for Speech Emotion Recognition
    Pan, Zexu
    Luo, Zhaojie
    Yang, Jichen
    Li, Haizhou
    INTERSPEECH 2020, 2020, : 364 - 368
  • [43] Efficient Interaction-Aware Interval Analysis of Neural Network Feedback Loops
    Jafarpour, Saber
    Harapanahalli, Akash
    Coogan, Samuel
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (12) : 8706 - 8721
  • [44] Speech emotion recognition for human–computer interaction
    D. Thiripurasundari
    Kishor Bhangale
    V. Aashritha
    Sisira Mondreti
    Mohanaprasad Kothandaraman
    International Journal of Speech Technology, 2024, 27 (3) : 817 - 830
  • [45] Goal-Guided and Interaction-Aware State Refinement Graph Attention Network for Multi-Agent Trajectory Prediction
    Chen, Xiaobo
    Luo, Fengbo
    Zhao, Feng
    Ye, Qiaolin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01): : 57 - 64
  • [46] Deep scattering network for speech emotion recognition
    Singh, Premjeet
    Saha, Goutam
    Sahidullah, Md
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 131 - 135
  • [47] Speech Emotion Recognition with Hybrid Neural Network
    Wei, Chuanzheng
    Sun, Xiao
    Tian, Fang
    Ren, Fuji
    5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2019), 2019, : 298 - 302
  • [48] Graph Isomorphism Network for Speech Emotion Recognition
    Liu, Jiawang
    Wang, Haoxiang
    INTERSPEECH 2021, 2021, : 3405 - 3409
  • [49] Gender-Aware Speech Emotion Recognition in Multiple Languages
    Nicolini, Marco
    Ntalampiras, Stavros
    PATTERN RECOGNITION APPLICATIONS AND METHODS, ICPRAM 2023, 2024, 14547 : 111 - 123
  • [50] DILATED RESIDUAL NETWORK WITH MULTI-HEAD SELF-ATTENTION FOR SPEECH EMOTION RECOGNITION
    Li, Runnan
    Wu, Zhiyong
    Jia, Jia
    Zhao, Sheng
    Meng, Helen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6675 - 6679