Speech Emotion Recognition Using an Enhanced Kernel Isomap for Human-Robot Interaction

被引:11
作者
Zhang, Shiqing [1 ]
Zhao, Xiaoming [2 ]
Lei, Bicheng [1 ]
机构
[1] Taizhou Univ, Sch Phys & Elect Engn, Taizhou, Peoples R China
[2] Taizhou Univ, Dept Comp Sci, Taizhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech Emotion Recognition; Nonlinear Dimensionality Reduction; Human-Robot Interaction; CLASSIFICATION;
D O I
10.5772/55403
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Speech emotion recognition is currently an active research subject and has attracted extensive interest in the science community due to its vital application to human-robot interaction. Most speech emotion recognition systems employ high-dimensional speech features, indicating human emotion expression, to improve emotion recognition performance. To effectively reduce the size of speech features, in this paper, a new nonlinear dimensionality reduction method, called 'enhanced kernel isometric mapping' (EKIsomap), is proposed and applied for speech emotion recognition in human-robot interaction. The proposed method is used to nonlinearly extract the low-dimensional discriminating embedded data representations from the original high-dimensional speech features with a striking improvement of performance on the speech emotion recognition tasks. Experimental results on the popular Berlin emotional speech corpus demonstrate the effectiveness of the proposed method.
引用
收藏
页数:7
相关论文
共 18 条
[1]  
Burkhardt F., 2005, INTERSPEECH, V5, P1517, DOI DOI 10.21437/INTERSPEECH.2005-446
[2]   Robust kernel Isomap [J].
Choi, Heeyoul ;
Choi, Seungjin .
PATTERN RECOGNITION, 2007, 40 (03) :853-862
[3]  
Jolliffe I., 2002, PRINCIPAL COMPONENT, DOI [10.1007/978-1-4757-1904-8_7, 10.1016/0169-7439(87)80084-9]
[4]   AN EXPLORATORY STUDY OF MANIFOLDS OF EMOTIONAL SPEECH [J].
Kim, Jangwon ;
Lee, Sungbok ;
Narayanan, Shrikanth S. .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5142-5145
[5]   Affective state estimation for human-robot interaction [J].
Kulic, Dana ;
Croft, Elizabeth A. .
IEEE TRANSACTIONS ON ROBOTICS, 2007, 23 (05) :991-1000
[6]   Toward detecting emotions in spoken dialogs [J].
Lee, CM ;
Narayanan, SS .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (02) :293-303
[7]   Feature Analysis and Evaluation for Automatic Emotion Identification in Speech [J].
Luengo, Iker ;
Navas, Eva ;
Hernaez, Inmaculada .
IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (06) :490-501
[8]   Feature Vector Classification based Speech Emotion Recognition for Service Robots [J].
Park, Jeong-Sik ;
Kim, Ji-Hwan ;
Oh, Yung-Hwan .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) :1590-1596
[9]  
Picard RW., 1997, AFFECTIVE COMPUTING
[10]   Efficient voice activity detection algorithms using long-term speech information [J].
Ramírez, J ;
Segura, JC ;
Benítez, C ;
de la Torre, A ;
Rubio, A .
SPEECH COMMUNICATION, 2004, 42 (3-4) :271-287