Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

被引:0
作者
Vishnu Vidyadhara Raju Vegesna
Krishna Gurugubelli
Anil Kumar Vuppala
机构
[1] KCIS,Speech Processing Lab
[2] International Institute of Information Technology,undefined
[3] Hyderabad (IIIT-H),undefined
来源
Mobile Networks and Applications | 2019年 / 24卷
关键词
ASR; Emotion recognition; Emotive speech;
D O I
暂无
中图分类号
学科分类号
摘要
Majority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech is considered to be the crucial aspect of human-machine interaction. The combined spectral and differenced prosody features are considered for the task of the emotion recognition in the first stage. The task of emotion recognition does not serve the sole purpose of improvement in the performance of an ASR system. Based on the recognized emotions from the input speech, the corresponding adapted emotive ASR model is selected for the evaluation in the second stage. This adapted emotive ASR model is built using the existing neutral and synthetically generated emotive speech using prosody modification method. In this work, the importance of emotion recognition block at the front-end along with the emotive speech adaptation to the ASR system models were studied. The speech samples from IIIT-H Telugu speech corpus were considered for building the large vocabulary ASR systems. The emotional speech samples from IITKGP-SESC Telugu corpus were used for the evaluation. The adapted emotive speech models have yielded better performance over the existing neutral speech models.
引用
收藏
页码:193 / 201
页数:8
相关论文
共 50 条
  • [41] Speech emotion recognition in acted and spontaneous context
    Chenchah, Farah
    Lachiri, Zied
    6TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2014, 2014, 39 : 139 - 145
  • [42] Speech emotion recognition: Features and classification models
    Chen, Lijiang
    Mao, Xia
    Xue, Yuli
    Cheng, Lee Lung
    DIGITAL SIGNAL PROCESSING, 2012, 22 (06) : 1154 - 1160
  • [43] SPEECH EMOTION RECOGNITION WITH ACOUSTIC AND LEXICAL FEATURES
    Jin, Qin
    Li, Chengxin
    Chen, Shizhe
    Wu, Huimin
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4749 - 4753
  • [44] Statistical Evaluation of Speech Features for Emotion Recognition
    Iliou, Theodoros
    Anagnostopoulos, Christos-Nikolaos
    ICDT: 2009 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL TELECOMMUNICATIONS, 2009, : 121 - 126
  • [45] SPEECH EMOTION RECOGNITION USING SEMANTIC INFORMATION
    Tzirakis, Panagiotis
    Anh Nguyen
    Zafeiriou, Stefanos
    Schuller, Bjoern W.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6279 - 6283
  • [46] On the use of speech parameter contours for emotion recognition
    Vidhyasaharan Sethu
    Eliathamby Ambikairajah
    Julien Epps
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [47] Emotion Recognition in Speech of Parents of Depressed Adolescents
    He, Ling
    Lech, Margaret
    Maddage, Namunu
    Allen, Nicholas
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 2696 - +
  • [48] Deep ganitrus algorithm for speech emotion recognition
    Shukla, Shilpi
    Jain, Madhu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 5353 - 5368
  • [49] Speech emotion recognition based on HMM and SVM
    Lin, YL
    Wei, G
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4898 - 4901
  • [50] Anchor Models for Emotion Recognition from Speech
    Attabi, Yazid
    Dumouchel, Pierre
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (03) : 280 - 290