Speaker normalisation for speech-based emotion detection

被引:32
|
作者
Sethu, Vidhyasaharan [1 ,2 ]
Ambikairajah, Eliathainby [1 ,2 ]
Epps, Julien [1 ,3 ]
机构
[1] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
[2] NICTA, Sydney, NSW, Australia
[3] UNSW Asia, Singapore 248922, Singapore
来源
PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING | 2007年
关键词
feature warping; cumulative distribution mapping; emotion detection; hidden Markov model;
D O I
10.1109/ICDSP.2007.4288656
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The focus of this paper is on speech-based emotion detection utilising only acoustic data, i.e. without using any linguistic or semantic information. However, this approach in general Suffers from the fact that acoustic data is speaker-dependent, and can result in inefficient estimation of the statistics modelled by classifiers such as hidden Markov models (HMMs) and Gaussian mixture models (GMMs). We propose the use of speaker-specific feature warping as a means of normalising acoustic features to overcome the problem of speaker dependency. In this paper we compare the performance of a system that uses feature warping to one that does not, The back-end employs ail HMM-based classifier that captures the temporal variations of the feature vectors by modelling them as transitions between different states. Evaluations conducted oil the LDC Emotional Prosody speech corpus reveal a relative increase in classification accuracy of up to 20%.
引用
收藏
页码:611 / +
页数:2
相关论文
共 50 条
  • [1] Speech-Based Techniques for Emotion Detection in Natural Arabic Audio Files
    Kaloub, Ashraf
    Elgabar, Eltyeb Abed
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2025, 22 (01) : 139 - 157
  • [2] Speech-based Emotion Characterization using Postures and Gestures in CVEs
    Amarakeerthi, Senaka
    Ranaweera, Rasika
    Cohen, Michael
    2010 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2010), 2010, : 72 - 76
  • [3] Excitation Features of Speech for Speaker-Specific Emotion Detection
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    IEEE ACCESS, 2020, 8 (08): : 60382 - 60391
  • [4] Utility indicator for emotion detection in a speaker authentication system
    van Rensburg, Ebenhaeser Otto Janse
    Botha, Reinhardt A.
    von Solms, Rossouw
    INFORMATION AND COMPUTER SECURITY, 2022, 30 (05) : 672 - 686
  • [5] Speech based emotion classification
    Nwe, TL
    Wei, FS
    De Silva, LC
    IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 297 - 301
  • [6] Shape-based modeling of the fundamental frequency contour for emotion detection in speech
    Arias, Juan Pablo
    Busso, Carlos
    Yoma, Nestor Becerra
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) : 278 - 294
  • [7] Machine Learning in Human Emotion Detection from the Speech
    Qiu, Xiaoli
    Li, Wei
    Li, Yang
    Gu, Hongmei
    Song, Fei
    Sabitha, R.
    JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP04)
  • [8] COMPREHENSIVE STUDY FOR EMOTION DETECTION USING SPEECH DIALOGUE
    Yadav, Rajat
    Mishra, Anurag
    ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2021, 20 (03): : 437 - 444
  • [9] Speech Emotion Recognition Based on Dynamic Models
    Lv, Guoyun
    Hu, Shuixian
    Lu, Xipan
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 480 - 484
  • [10] Speech emotion recognition based on HMM and SVM
    Lin, YL
    Wei, G
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4898 - 4901