Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

被引：0

作者：

Vishnu Vidyadhara Raju Vegesna

Krishna Gurugubelli

Anil Kumar Vuppala

机构：

[1] KCIS,Speech Processing Lab

[2] International Institute of Information Technology,undefined

[3] Hyderabad (IIIT-H),undefined

来源：

Mobile Networks and Applications | 2019年 / 24卷

关键词：

ASR; Emotion recognition; Emotive speech;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Majority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech is considered to be the crucial aspect of human-machine interaction. The combined spectral and differenced prosody features are considered for the task of the emotion recognition in the first stage. The task of emotion recognition does not serve the sole purpose of improvement in the performance of an ASR system. Based on the recognized emotions from the input speech, the corresponding adapted emotive ASR model is selected for the evaluation in the second stage. This adapted emotive ASR model is built using the existing neutral and synthetically generated emotive speech using prosody modification method. In this work, the importance of emotion recognition block at the front-end along with the emotive speech adaptation to the ASR system models were studied. The speech samples from IIIT-H Telugu speech corpus were considered for building the large vocabulary ASR systems. The emotional speech samples from IITKGP-SESC Telugu corpus were used for the evaluation. The adapted emotive speech models have yielded better performance over the existing neutral speech models.

引用

页码：193 / 201

页数：8

共 50 条

[41] Speech emotion recognition in acted and spontaneous context
Chenchah, Farah
Lachiri, Zied
6TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2014, 2014, 39 : 139 - 145
[42] Speech emotion recognition: Features and classification models
Chen, Lijiang
Mao, Xia
Xue, Yuli
Cheng, Lee Lung
DIGITAL SIGNAL PROCESSING, 2012, 22 (06) : 1154 - 1160
[43] SPEECH EMOTION RECOGNITION WITH ACOUSTIC AND LEXICAL FEATURES
Jin, Qin
Li, Chengxin
Chen, Shizhe
Wu, Huimin
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4749 - 4753
[44] Statistical Evaluation of Speech Features for Emotion Recognition
Iliou, Theodoros
Anagnostopoulos, Christos-Nikolaos
ICDT: 2009 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL TELECOMMUNICATIONS, 2009, : 121 - 126
[45] SPEECH EMOTION RECOGNITION USING SEMANTIC INFORMATION
Tzirakis, Panagiotis
Anh Nguyen
Zafeiriou, Stefanos
Schuller, Bjoern W.
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6279 - 6283
[46] On the use of speech parameter contours for emotion recognition
Vidhyasaharan Sethu
Eliathamby Ambikairajah
Julien Epps
EURASIP Journal on Audio, Speech, and Music Processing, 2013
[47] Emotion Recognition in Speech of Parents of Depressed Adolescents
He, Ling
Lech, Margaret
Maddage, Namunu
Allen, Nicholas
2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 2696 - +
[48] Deep ganitrus algorithm for speech emotion recognition
Shukla, Shilpi
Jain, Madhu
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 5353 - 5368
[49] Speech emotion recognition based on HMM and SVM
Lin, YL
Wei, G
PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4898 - 4901
[50] Anchor Models for Emotion Recognition from Speech
Attabi, Yazid
Dumouchel, Pierre
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (03) : 280 - 290

← 1 2 3 4 5 →