Emotion Recognition using Imperfect Speech Recognition

被引：0

作者：

Metze, Florian ^{[1
]}

Batliner, Anton ^{[4
]}

Eyben, Florian ^{[2
]}

Polzehl, Tim ^{[3
]}

Schuller, Bjoern ^{[2
]}

Steidl, Stefan ^{[4
]}

机构：

[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA

[2] Tech Univ Munich, Inst Human Machine Commun, Munich, Germany

[3] Tech Univ Berlin, Qual & Usabil Lab, Berlin, Germany

[4] Friedrich Alexander Univ, Lehrstuhl Mustererkennung, Erlangen, Germany

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年

关键词：

speech-to-text; emotion detection; meta-data extraction; rich transcription; children's speech;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates the use of speech-to-text methods for assigning an emotion class to a given speech utterance. Previous work shows that an emotion extracted from text can convey complementary evidence to the information extracted by classifiers based on spectral, or other non-linguistic features. As speech-to-text usually presents significantly more computational effort, in this study we investigate the degree of speech-to-text accuracy needed for reliable detection of emotions from an automatically generated transcription of an utterance. We evaluate the use of hypotheses in both training and testing, and compare several classification approaches on the same task. Our results show that emotion recognition performance stays roughly constant as long as word accuracy doesn't fall below a reasonable value, making the use of speech-to-text viable for training of emotion classifiers based on linguistics.

引用

页码：478 / +

页数：2

共 50 条

[11] Speech emotion recognition using data augmentation
V. M. Praseetha
P. P. Joby
International Journal of Speech Technology, 2022, 25 : 783 - 792
[12] Emotion recognition in speech using neural networks
Nicholson, J
Takahashi, K
Nakatsu, R
NEURAL COMPUTING & APPLICATIONS, 2000, 9 (04): : 290 - 296
[13] RECOGNITION OF EMOTION IN SPEECH USING SPECTRAL PATTERNS
Shahzadi, Ali
Ahmadyfard, Alireza
Yaghmaie, Khashayar
Harimi, Ali
MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2013, 26 (02) : 140 - 158
[14] Speech Emotion Recognition Using Audio Matching
Chaturvedi, Iti
Noel, Tim
Satapathy, Ranjan
ELECTRONICS, 2022, 11 (23)
[15] Speech Emotion Recognition Using Fourier Parameters
Wang, Kunxia
An, Ning
Li, Bing Nan
Zhang, Yanyong
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2015, 6 (01) : 69 - 75
[16] Speech Emotion Recognition Using Deep Learning
Alagusundari, N.
Anuradha, R.
ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
[17] Speech Emotion Recognition Using Data Augmentation
Kapoor, Tanisha
Ganguly, Arnaja
Rajeswari, D.
2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
[18] Emotion Recognition in Spontaneous Speech Using GMMs
Neiberg, Daniel
Elenius, Kjell
Laskowski, Kornel
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 809 - +
[19] Speech Emotion Recognition Using Deep Learning
Ahmed, Waqar
Riaz, Sana
Iftikhar, Khunsa
Konur, Savas
ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
[20] Speech emotion recognition using data augmentation
Praseetha, V. M.
Joby, P. P.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 25 (4) : 783 - 792

← 1 2 3 4 5 →