Speaker normalisation for speech-based emotion detection

被引：32

作者：

Sethu, Vidhyasaharan ^{[1
,2
]}

Ambikairajah, Eliathainby ^{[1
,2
]}

Epps, Julien ^{[1
,3
]}

机构：

[1] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia

[2] NICTA, Sydney, NSW, Australia

[3] UNSW Asia, Singapore 248922, Singapore

来源：

PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING | 2007年

关键词：

feature warping; cumulative distribution mapping; emotion detection; hidden Markov model;

D O I：

10.1109/ICDSP.2007.4288656

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The focus of this paper is on speech-based emotion detection utilising only acoustic data, i.e. without using any linguistic or semantic information. However, this approach in general Suffers from the fact that acoustic data is speaker-dependent, and can result in inefficient estimation of the statistics modelled by classifiers such as hidden Markov models (HMMs) and Gaussian mixture models (GMMs). We propose the use of speaker-specific feature warping as a means of normalising acoustic features to overcome the problem of speaker dependency. In this paper we compare the performance of a system that uses feature warping to one that does not, The back-end employs ail HMM-based classifier that captures the temporal variations of the feature vectors by modelling them as transitions between different states. Evaluations conducted oil the LDC Emotional Prosody speech corpus reveal a relative increase in classification accuracy of up to 20%.

引用

页码：611 / +

页数：2

共 50 条

[1] Speech-Based Techniques for Emotion Detection in Natural Arabic Audio Files
Kaloub, Ashraf
Elgabar, Eltyeb Abed
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2025, 22 (01) : 139 - 157
[2] Speech-based Emotion Characterization using Postures and Gestures in CVEs
Amarakeerthi, Senaka
Ranaweera, Rasika
Cohen, Michael
2010 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2010), 2010, : 72 - 76
[3] Excitation Features of Speech for Speaker-Specific Emotion Detection
Kadiri, Sudarsana Reddy
Alku, Paavo
IEEE ACCESS, 2020, 8 (08): : 60382 - 60391
[4] Utility indicator for emotion detection in a speaker authentication system
van Rensburg, Ebenhaeser Otto Janse
Botha, Reinhardt A.
von Solms, Rossouw
INFORMATION AND COMPUTER SECURITY, 2022, 30 (05) : 672 - 686
[5] Speech based emotion classification
Nwe, TL
Wei, FS
De Silva, LC
IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 297 - 301
[6] Shape-based modeling of the fundamental frequency contour for emotion detection in speech
Arias, Juan Pablo
Busso, Carlos
Yoma, Nestor Becerra
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) : 278 - 294
[7] Machine Learning in Human Emotion Detection from the Speech
Qiu, Xiaoli
Li, Wei
Li, Yang
Gu, Hongmei
Song, Fei
Sabitha, R.
JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP04)
[8] COMPREHENSIVE STUDY FOR EMOTION DETECTION USING SPEECH DIALOGUE
Yadav, Rajat
Mishra, Anurag
ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2021, 20 (03): : 437 - 444
[9] Speech Emotion Recognition Based on Dynamic Models
Lv, Guoyun
Hu, Shuixian
Lu, Xipan
2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 480 - 484
[10] Speech emotion recognition based on HMM and SVM
Lin, YL
Wei, G
PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4898 - 4901

← 1 2 3 4 5 →