Mandarin emotion recognition combining acoustic and emotional point information

被引:18
作者
Chen, Lijiang [1 ]
Mao, Xia [1 ]
Wei, Pengfei [1 ]
Xue, Yuli [1 ]
Ishizuka, Mitsuru [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Univ Tokyo, Dept Informat & Commun Engn, Tokyo, Japan
关键词
Mandarin emotion recognition; Emotional point; Fisher rate; Support vector machine; Hidden Markov model; FEATURE VECTOR NORMALIZATION; SPEECH RECOGNITION; PROFILES; FEATURES; SYSTEMS;
D O I
10.1007/s10489-012-0352-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this contribution, we introduce a novel approach to combine acoustic information and emotional point information for a robust automatic recognition of a speaker's emotion. Six discrete emotional states are recognized in the work. Firstly, a multi-level model for emotion recognition by acoustic features is presented. The derived features are selected by fisher rate to distinguish different types of emotions. Secondly, a novel emotional point model for Mandarin is established by Support Vector Machine and Hidden Markov Model. This model contains 28 emotional syllables which reflect rich emotional information. Finally the acoustic information and emotional point information are integrated by a soft decision strategy. Experimental results show that the application of emotional point information in speech emotion recognition is effective.
引用
收藏
页码:602 / 612
页数:11
相关论文
共 43 条
[21]   Speech Emotion Recognition Based on Parametric Filter and Fractal Dimension [J].
Mao, Xia ;
Chen, Lijiang .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (08) :2324-2326
[22]  
Mehrabian A., 1974, APPROACH ENV PSYCHOL, V12
[23]  
Nogueiras A., 2001, Proceedings of INTERSPEECH, P2679, DOI DOI 10.21437/EUROSPEECH.2001-627
[24]   PARALLEL ALGORITHMS FOR HIERARCHICAL-CLUSTERING [J].
OLSON, CF .
PARALLEL COMPUTING, 1995, 21 (08) :1313-1325
[25]  
Petrushin ValeryA., 2000, PROC ICSLP 2000, P222
[26]  
Picard R. W., 2000, Affective computing
[27]  
Plutchik R., 1980, EMOTION PSYCHOEVOLUT
[28]   FLOATING SEARCH METHODS IN FEATURE-SELECTION [J].
PUDIL, P ;
NOVOVICOVA, J ;
KITTLER, J .
PATTERN RECOGNITION LETTERS, 1994, 15 (11) :1119-1125
[29]   Vocal communication of emotion: A review of research paradigms [J].
Scherer, KR .
SPEECH COMMUNICATION, 2003, 40 (1-2) :227-256
[30]  
Schuller B, 2005, 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, P865