Reinforcement Learning based Data Augmentation for Noise Robust Speech Emotion Recognition

被引:0
作者
Ranjan, Sumit [1 ]
Chakraborty, Rupayan [1 ]
Kopparapu, Sunil Kumar [1 ]
机构
[1] Tata Consultancy Serv Ltd, TCS Res, Bengaluru, India
来源
INTERSPEECH 2024 | 2024年
关键词
speech emotion recognition; noise robustness; selective data augmentation; reinforcement learning;
D O I
10.21437/Interspeech.2024-921
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech emotion recognition (SER) is an indispensable component of any human machine interactions, and enables building empathetic voice user interfaces. Ability to accurately recognize emotion in noisy environments is important in practical scenarios when a person is interacting with a machine or an agent as in the case of a voice based call center. In this paper, we propose reinforcement learning (RL) based data augmentation technique to enable building a robust SER system. The reward function used in RL enables picking selective noises spread over different frequency bands for data augmentation. We show that the proposed RL based augmentation technique is superior to a recently proposed random selection based technique for the noise robust SER task. We use IEMOCAP dataset with four emotion classes for validating the proposed technique. Moreover, we test the noise robustness of SER system in both cross-corpus and cross-language scenarios.
引用
收藏
页码:1040 / 1044
页数:5
相关论文
共 23 条
  • [21] Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap
    Wagner, Johannes
    Triantafyllopoulos, Andreas
    Wierstorf, Hagen
    Schmitt, Maximilian
    Burkhardt, Felix
    Eyben, Florian
    Schuller, Bjoern W. W.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 10745 - 10759
  • [22] Head Fusion: Improving the Accuracy and Robustness of Speech Emotion Recognition on the IEMOCAP and RAVDESS Dataset
    Xu, Mingke
    Zhang, Fan
    Zhang, Wei
    [J]. IEEE ACCESS, 2021, 9 : 74539 - 74549
  • [23] Yi L, 2019, ASIAPAC SIGN INFO PR, P529, DOI [10.1109/APSIPAASC47483.2019.9023347, 10.1109/apsipaasc47483.2019.9023347]