Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

被引:0
作者
Vishnu Vidyadhara Raju Vegesna
Krishna Gurugubelli
Anil Kumar Vuppala
机构
[1] KCIS,Speech Processing Lab
[2] International Institute of Information Technology,undefined
[3] Hyderabad (IIIT-H),undefined
来源
Mobile Networks and Applications | 2019年 / 24卷
关键词
ASR; Emotion recognition; Emotive speech;
D O I
暂无
中图分类号
学科分类号
摘要
Majority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech is considered to be the crucial aspect of human-machine interaction. The combined spectral and differenced prosody features are considered for the task of the emotion recognition in the first stage. The task of emotion recognition does not serve the sole purpose of improvement in the performance of an ASR system. Based on the recognized emotions from the input speech, the corresponding adapted emotive ASR model is selected for the evaluation in the second stage. This adapted emotive ASR model is built using the existing neutral and synthetically generated emotive speech using prosody modification method. In this work, the importance of emotion recognition block at the front-end along with the emotive speech adaptation to the ASR system models were studied. The speech samples from IIIT-H Telugu speech corpus were considered for building the large vocabulary ASR systems. The emotional speech samples from IITKGP-SESC Telugu corpus were used for the evaluation. The adapted emotive speech models have yielded better performance over the existing neutral speech models.
引用
收藏
页码:193 / 201
页数:8
相关论文
共 50 条
  • [31] Emotion recognition in speech using neural networks
    Nicholson, J
    Takahashi, K
    Nakatsu, R
    NEURAL COMPUTING & APPLICATIONS, 2000, 9 (04) : 290 - 296
  • [32] A Comprehensive Review of Speech Emotion Recognition Systems
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Kartiwi, Mira
    Ambikairajah, Eliathamby
    IEEE ACCESS, 2021, 9 : 47795 - 47814
  • [33] Emotion recognition from speech - Tools and Challenges
    Al-Talabani, Abdulbasit
    Sellahewa, Harin
    Jassim, Sabah A.
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2015, 2015, 9497
  • [34] Learning Transferable Features for Speech Emotion Recognition
    Marczewski, Alison
    Veloso, Adriano
    Ziviani, Nivio
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 529 - 536
  • [35] Speech Emotion Recognition Based on Dynamic Models
    Lv, Guoyun
    Hu, Shuixian
    Lu, Xipan
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 480 - 484
  • [36] COPYPASTE: AN AUGMENTATION METHOD FOR SPEECH EMOTION RECOGNITION
    Pappagari, Raghavendra
    Villalba, Jesus
    Zelasko, Piotr
    Moro-Velazquez, Laureano
    Dehak, Najim
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6324 - 6328
  • [37] Fuzzy emotion recognition in natural speech dialogue
    Austermann, A
    Esau, N
    Kleinjohann, L
    Kleinjohann, B
    2005 IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2005, : 317 - 322
  • [38] Acoustic-Prosodic Recognition of Emotion in Speech
    Montenegro, Chuchi S.
    Maravillas, Elmer A.
    2015 INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY,COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2015, : 527 - +
  • [39] Novel acoustic features for speech emotion recognition
    ROH Yong-Wan
    KIM Dong-Ju
    LEE Woo-Seok
    HONG Kwang-Seok
    Science in China(Series E:Technological Sciences), 2009, (07) : 1838 - 1848
  • [40] Biologically inspired emotion recognition from speech
    Caponetti, Laura
    Buscicchio, Cosimo Alessandro
    Castellano, Giovanna
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,