Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

被引：0

作者：

Vishnu Vidyadhara Raju Vegesna

Krishna Gurugubelli

Anil Kumar Vuppala

机构：

[1] KCIS,Speech Processing Lab

[2] International Institute of Information Technology,undefined

[3] Hyderabad (IIIT-H),undefined

来源：

Mobile Networks and Applications | 2019年 / 24卷

关键词：

ASR; Emotion recognition; Emotive speech;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Majority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech is considered to be the crucial aspect of human-machine interaction. The combined spectral and differenced prosody features are considered for the task of the emotion recognition in the first stage. The task of emotion recognition does not serve the sole purpose of improvement in the performance of an ASR system. Based on the recognized emotions from the input speech, the corresponding adapted emotive ASR model is selected for the evaluation in the second stage. This adapted emotive ASR model is built using the existing neutral and synthetically generated emotive speech using prosody modification method. In this work, the importance of emotion recognition block at the front-end along with the emotive speech adaptation to the ASR system models were studied. The speech samples from IIIT-H Telugu speech corpus were considered for building the large vocabulary ASR systems. The emotional speech samples from IITKGP-SESC Telugu corpus were used for the evaluation. The adapted emotive speech models have yielded better performance over the existing neutral speech models.

引用

页码：193 / 201

页数：8

共 50 条

[31] Emotion recognition in speech using neural networks
Nicholson, J
Takahashi, K
Nakatsu, R
NEURAL COMPUTING & APPLICATIONS, 2000, 9 (04) : 290 - 296
[32] A Comprehensive Review of Speech Emotion Recognition Systems
Wani, Taiba Majid
Gunawan, Teddy Surya
Qadri, Syed Asif Ahmad
Kartiwi, Mira
Ambikairajah, Eliathamby
IEEE ACCESS, 2021, 9 : 47795 - 47814
[33] Emotion recognition from speech - Tools and Challenges
Al-Talabani, Abdulbasit
Sellahewa, Harin
Jassim, Sabah A.
MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2015, 2015, 9497
[34] Learning Transferable Features for Speech Emotion Recognition
Marczewski, Alison
Veloso, Adriano
Ziviani, Nivio
PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 529 - 536
[35] Speech Emotion Recognition Based on Dynamic Models
Lv, Guoyun
Hu, Shuixian
Lu, Xipan
2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 480 - 484
[36] COPYPASTE: AN AUGMENTATION METHOD FOR SPEECH EMOTION RECOGNITION
Pappagari, Raghavendra
Villalba, Jesus
Zelasko, Piotr
Moro-Velazquez, Laureano
Dehak, Najim
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6324 - 6328
[37] Fuzzy emotion recognition in natural speech dialogue
Austermann, A
Esau, N
Kleinjohann, L
Kleinjohann, B
2005 IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2005, : 317 - 322
[38] Acoustic-Prosodic Recognition of Emotion in Speech
Montenegro, Chuchi S.
Maravillas, Elmer A.
2015 INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY,COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2015, : 527 - +
[39] Novel acoustic features for speech emotion recognition
ROH Yong-Wan
KIM Dong-Ju
LEE Woo-Seok
HONG Kwang-Seok
Science in China(Series E:Technological Sciences), 2009, (07) : 1838 - 1848
[40] Biologically inspired emotion recognition from speech
Caponetti, Laura
Buscicchio, Cosimo Alessandro
Castellano, Giovanna
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,

← 1 2 3 4 5 →