Multi-stage classification of emotional speech motivated by a dimensional emotion model

被引:21
作者
Xiao, Zhongzhe [1 ]
Dellandrea, Emmanuel [1 ]
Dou, Weibei [2 ]
Chen, Liming [1 ]
机构
[1] Univ Lyon, LIRIS Lab, UMR5205, CNRS,Ecole Cent Lyon, F-69134 Ecully, France
[2] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
Emotional speech; Harmonic feature; Zipf feature; Dimensional emotion model; Multi-stage classification;
D O I
10.1007/s11042-009-0319-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with speech emotion analysis within the context of increasing awareness of the wide application potential of affective computing. Unlike most works in the literature which mainly rely on classical frequency and energy based features along with a single global classifier for emotion recognition, we propose in this paper some new harmonic and Zipf based features for better speech emotion characterization in the valence dimension and a multi-stage classification scheme driven by a dimensional emotion model for better emotional class discrimination. Experimented on the Berlin dataset with 68 features and six emotion states, our approach shows its effectiveness, displaying a 68.60% classification rate and reaching a 71.52% classification rate when a gender classification is first applied. Using the DES dataset with five emotion states, our approach achieves an 81% recognition rate when the best performance in the literature to our knowledge is 76.15% on the same dataset.
引用
收藏
页码:119 / 145
页数:27
相关论文
共 55 条
[1]  
[Anonymous], 2000, ENGLISHAND JAPANESE
[2]  
[Anonymous], P EUR SIGN PROC C
[3]  
[Anonymous], 2000, Proc. of the Speech-Emotion-2000
[4]  
[Anonymous], P ISCA WORKSH SPEECH
[5]  
[Anonymous], P ISCA WORKSH SPEECH
[6]  
[Anonymous], 1989, The Biopsychology of Mood and Arousal
[7]  
[Anonymous], 2006, Pattern recognition and machine learning
[8]   PATTERN-RECOGNITION APPROACH TO VOICED UNVOICED SILENCE CLASSIFICATION WITH APPLICATIONS TO SPEECH RECOGNITION [J].
ATAL, BS ;
RABINER, LR .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (03) :201-212
[9]   Acoustic profiles in vocal emotion expression [J].
Banse, R ;
Scherer, KR .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1996, 70 (03) :614-636
[10]  
Bellman R., 1961, Adaptive Control Processes: A Guided Tour, DOI DOI 10.1515/9781400874668