Speech Emotion Classification on a Riemannian Manifold

被引：0

作者：

Ye, Chengxi ^{[1
]}

Liu, Jia ^{[1
]}

Chen, Chun ^{[1
]}

Song, Mingli ^{[1
]}

Bu, Jiajun ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China

来源：

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA | 2008年 / 5353卷

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a novel algorithm for speech emotion classification. In contrast to previous methods, we additionally consider the relations between simple features by incorporating covariance matrices as the new feature descriptors. Since non-singular covariance matrices do not lie on a linear space, we endow the space with an affine invariance metric and render it into a Riemannian manifold. After that we use the tangent space to approximate the manifold. Classification is performed in the tangent space and a generalized principal component analysis is presented. We test the algorithm on speech emotion classification and the experiment results show an improvement at around 13%(+3% with PCA) in recognition accuracy. Based on that we are able to train one simple model to accurately differentiate the emotions from both genders.

引用

页码：61 / 69

页数：9

共 22 条

[1] [Anonymous], P EUR
[2] BEZOOIJEN RV, 1984, CHARACTERISITCS RECO
[3] Carmo M. P. D., 1976, DIFFERENTIAL GEOMETR
[4] CHATEAU N, 2004, P INTERSPEECH ICSLP, P885
[5] Emotion recognition using acoustic features and textual content
Chuang, ZJ
Wu, CH
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 53 - 56
[6] Emotion recognition in human-computer interaction
Cowie, R
Douglas-Cowie, E
Tsapatsoulis, N
Votsis, G
Kollias, S
Fellenz, W
Taylor, JG
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (01) : 32 - 80
[7] Riemannian geometry for the statistical analysis of diffusion tensor data
Fletcher, P. Thomas
Joshi, Sarang
[J]. SIGNAL PROCESSING, 2007, 87 (02) : 250 - 262
[8] PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH
HERMANSKY, H
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) : 1738 - 1752
[9] LINE SPECTRUM REPRESENTATION OF LINEAR PREDICTOR COEFFICIENTS OF SPEECH SIGNALS
ITAKURA, F
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 : S35 - S35
[10] Speech recognition by machines and humans
Lippmann, RP
[J]. SPEECH COMMUNICATION, 1997, 22 (01) : 1 - 15

← 1 2 3 →