Speech Emotion Recognition using Context-Aware Dilated Convolution Network

被引:5
|
作者
Kakuba, Samuel [1 ]
Han, Dong Seog [2 ]
机构
[1] Kyungpook Natl Univ, Grad Sch Elect & Elect Engn, Daegu, South Korea
[2] Kyungpook Natl Univ, Sch Elect & Elect Engn, Daegu, South Korea
来源
2022 27TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2022): CREATING INNOVATIVE COMMUNICATION TECHNOLOGIES FOR POST-PANDEMIC ERA | 2022年
关键词
context-aware emotion recognition; multi-head attention; dilated convolution;
D O I
10.1109/APCC55198.2022.9943771
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning-based speech emotion recognition has been applied for social living assistance, health monitoring, authentication, and other human-to-machine interaction applications. Because of the ubiquitous nature of the applications, computationally efficient and robust speech emotion recognition models are required. The nature of the speech signal requires tracking of time steps, analyzing long-term dependencies and the contexts of the utterances as well as the spatial cues. Recurrent neural networks like long short-term memory and gated recurrent units coupled with attention mechanisms are often used to consider long-term dependencies and context in the speech signal. However, they do not take care of the spatial cues that may exist in the speech signal. Moreover, the operation of most of these systems is sequential which causes slow convergence, and sluggish training. Therefore, we propose a model that employs dilated convolutions layers in combination with hybrid attention mechanisms. The model uses multi-head attention to extract the global context in the feature representations which are fed into the bidirectional long short-term memory configured with self-attention to further handle the context and long-term dependencies. The model uses spectral and voice quality features extracted from the raw speech signals as input. The proposed model achieves comparable performance in terms of F1 score and accuracy. The proposed model's performance is also presented in terms of confusion matrices.
引用
收藏
页码:601 / 604
页数:4
相关论文
共 50 条
  • [41] Predicting length of stay in ICU and mortality with temporal dilated separable convolution and context-aware feature fusion
    Al-Dailami, Abdulrahman
    Kuang, Hulin
    Wang, Jianxin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [42] A CONTEXT-AWARE SPEECH RECOGNITION AND UNDERSTANDING SYSTEM FOR AIR TRAFFIC CONTROL DOMAIN
    Oualil, Youssef
    Klakow, Dietrich
    Szaszak, Gyoergy
    Srinivasamurthy, Ajay
    Helmke, Hartmut
    Motlicek, Petr
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 404 - 408
  • [43] Context-aware recommender system using trust network
    Zeyneb El Yebdri
    Sidi Mohammed Benslimane
    Fedoua Lahfa
    Mahmoud Barhamgi
    Djamal Benslimane
    Computing, 2021, 103 : 1919 - 1937
  • [44] Context-aware handover using active network technology
    Wei, Qing
    Farkas, Karoly
    Prehofer, Christian
    Mendes, Paulo
    Plattner, Bernhard
    COMPUTER NETWORKS, 2006, 50 (15) : 2855 - 2872
  • [45] A Context-Aware Language Model to Improve the Speech Recognition in Air Traffic Control
    Guo, Dongyue
    Zhang, Zichen
    Fan, Peng
    Zhang, Jianwei
    Yang, Bo
    AEROSPACE, 2021, 8 (11)
  • [46] Context-aware recommender system using trust network
    El Yebdri, Zeyneb
    Benslimane, Sidi Mohammed
    Lahfa, Fedoua
    Barhamgi, Mahmoud
    Benslimane, Djamal
    COMPUTING, 2021, 103 (09) : 1919 - 1937
  • [47] Context-Aware Fall Detection Using A Bayesian Network
    Zhang, Mi
    Sawchuk, Alexander A.
    5TH ACM INTERNATIONAL WORKSHOP ON CONTEXT-AWARENESS FOR SELF-MANAGING SYSTEMS (CASEMANS 2011), 2011, : 10 - 16
  • [48] Object Recognition in a Context-Aware Application
    Vo, Tan
    Dat Tran
    Ma, Wanli
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [49] Context-Aware Personal Route Recognition
    Mazhelis, Oleksiy
    Zliobaite, Indre
    Pechenizkiy, Mykola
    DISCOVERY SCIENCE, 2011, 6926 : 221 - +
  • [50] Lightweight Context-Aware Activity Recognition
    Go, Byung Gill
    Khattak, Asad Masood
    Shah, Babar
    Khan, Adil Mehmood
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURE INFORMATION TECHNOLOGY, VOL 2, 2016, 354 : 367 - 373