Spontaneous Speech Emotion Recognition using Prior Knowledge

被引:0
|
作者
Chakraborty, Rupayan [1 ]
Pandharipande, Meghna [1 ]
Kopparapu, Sunil Kumar [1 ]
机构
[1] TCS Innovat Labs Mumbai, Yantra Pk, Thane West 400601, India
关键词
Emotion recognition; knowledge-based framework; spontaneous speech; non-acted emotion; call center audio; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic and spontaneous speech emotion recognition is an important part of a human-computer interactive system. However, emotion identification in spontaneous speech is difficult because most often the emotion expressed by the speaker are not necessarily as prominent as in acted speech. In this paper, we propose a spontaneous speech emotion recognition framework that makes use of the associated knowledge. The framework is motivated by the observation that there is significant disagreement amongst human annotators when they annotate spontaneous speech; the disagreement largely reduces when they are provided with additional knowledge related to the conversation. The proposed framework makes use of the contexts (derived from linguistic contents) and the knowledge regarding the time lapse of the spoken utterances in the context of an audio call to reliably recognize the current emotion of the speaker in spontaneous audio conversations. Our experimental results demonstrate that there is a significant improvement in the performance of spontaneous speech emotion recognition using the proposed framework.
引用
收藏
页码:2866 / 2871
页数:6
相关论文
共 50 条
  • [1] Emotion Recognition in Spontaneous Speech Using GMMs
    Neiberg, Daniel
    Elenius, Kjell
    Laskowski, Kornel
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 809 - +
  • [2] Knowledge-based framework for intelligent emotion recognition in spontaneous speech
    Chakraborty, Rupayan
    Pandharipande, Meghna
    Kopparapu, Sunil Kumar
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 587 - 596
  • [3] Pronunciation modeling for spontaneous speech recognition using latent pronunciation analysis (LPA) and prior knowledge
    Lin, Che-Kuang
    Lee, Lin-Shan
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 673 - +
  • [4] Emotion Recognition from Spontaneous Slavic Speech
    Atassi, Hicham
    Smekal, Zdenek
    Esposito, Anna
    3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012), 2012, : 389 - 394
  • [5] Speech emotion recognition in acted and spontaneous context
    Chenchah, Farah
    Lachiri, Zied
    6TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2014, 2014, 39 : 139 - 145
  • [6] Spontaneous Speech Emotion Recognition Using Multiscale Deep Convolutional LSTM
    Zhang, Shiqing
    Zhao, Xiaoming
    Tian, Qi
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (02) : 680 - 688
  • [7] CUBIC KNOWLEDGE DISTILLATION FOR SPEECH EMOTION RECOGNITION
    Lou, Zhibo
    Otake, Shinta
    Li, Zhengxiao
    Kawakami, Rei
    Inoue, Nakamasa
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5705 - 5709
  • [8] Using pitch as prior knowledge in template-based speech recognition
    Aradilla, Guillermo
    Vepa, Jithendra
    Bourlard, Herve
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 445 - 448
  • [9] Emotion Recognition from Spontaneous Tunisian Dialect Speech
    Nasr, Latifa Ibn
    Masmoudi, Abir
    Belguith, Lamia hadrich
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2025, 24 (02)
  • [10] Emotion Recognition using Imperfect Speech Recognition
    Metze, Florian
    Batliner, Anton
    Eyben, Florian
    Polzehl, Tim
    Schuller, Bjoern
    Steidl, Stefan
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 478 - +