Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization

被引:18
作者
Weninger, Felix [1 ]
Schuller, Bjoern [1 ]
Batliner, Anton [2 ]
Steidl, Stefan [2 ]
Seppi, Dino [3 ]
机构
[1] Tech Univ Munich, Lehrstuhl Mensch Maschine Kommunikat, D-80290 Munich, Germany
[2] Univ Erlangen Nurnberg, Mustererkennung Lab, D-91058 Erlangen, Germany
[3] Katholieke Univ Leuven, ESAT, B-3001 Louvain, Belgium
关键词
Feature Extraction; Speech Recognition; Emotion Recognition; Automatic Speech Recognition; Test Scenario;
D O I
10.1155/2011/838790
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a comprehensive study on the effect of reverberation and background noise on the recognition of nonprototypical emotions from speech. We carry out our evaluation on a single, well-defined task based on the FAU Aibo Emotion Corpus consisting of spontaneous children's speech, which was used in the INTERSPEECH 2009 Emotion Challenge, the first of its kind. Based on the challenge task, and relying on well-proven methodologies from the speech recognition domain, we derive test scenarios with realistic noise and reverberation conditions, including matched as well as mismatched condition training. As feature extraction based on supervised Nonnegative Matrix Factorization (NMF) has been proposed in automatic speech recognition for enhanced robustness, we introduce and evaluate different kinds of NMF-based features for emotion recognition. We conclude that NMF features can significantly contribute to the robustness of state-of-the-art emotion recognition engines in practical application scenarios where different noise and reverberation conditions have to be faced.
引用
收藏
页数:16
相关论文
共 51 条
[1]  
[Anonymous], SPEECH COMM IN PRESS
[2]  
[Anonymous], P SPEECH PROS DRESD
[3]  
[Anonymous], P EUR SIGN PROC C EU
[4]  
[Anonymous], THESIS FAU ERLANGEN
[5]  
[Anonymous], PROCC IEEE INT WORKS
[6]  
[Anonymous], 2000, P AUT SPEECH REC CHA
[7]  
[Anonymous], P 16 INT C NEUR INF
[8]  
[Anonymous], P IEEE INT C AC SPEE
[9]  
[Anonymous], P INTERSPEECH BRISB
[10]  
[Anonymous], P INT WORKSH IND COM