Speech Emotion Classification on a Riemannian Manifold

被引:0
作者
Ye, Chengxi [1 ]
Liu, Jia [1 ]
Chen, Chun [1 ]
Song, Mingli [1 ]
Bu, Jiajun [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA | 2008年 / 5353卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel algorithm for speech emotion classification. In contrast to previous methods, we additionally consider the relations between simple features by incorporating covariance matrices as the new feature descriptors. Since non-singular covariance matrices do not lie on a linear space, we endow the space with an affine invariance metric and render it into a Riemannian manifold. After that we use the tangent space to approximate the manifold. Classification is performed in the tangent space and a generalized principal component analysis is presented. We test the algorithm on speech emotion classification and the experiment results show an improvement at around 13%(+3% with PCA) in recognition accuracy. Based on that we are able to train one simple model to accurately differentiate the emotions from both genders.
引用
收藏
页码:61 / 69
页数:9
相关论文
共 22 条
  • [1] [Anonymous], P EUR
  • [2] BEZOOIJEN RV, 1984, CHARACTERISITCS RECO
  • [3] Carmo M. P. D., 1976, DIFFERENTIAL GEOMETR
  • [4] CHATEAU N, 2004, P INTERSPEECH ICSLP, P885
  • [5] Emotion recognition using acoustic features and textual content
    Chuang, ZJ
    Wu, CH
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 53 - 56
  • [6] Emotion recognition in human-computer interaction
    Cowie, R
    Douglas-Cowie, E
    Tsapatsoulis, N
    Votsis, G
    Kollias, S
    Fellenz, W
    Taylor, JG
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (01) : 32 - 80
  • [7] Riemannian geometry for the statistical analysis of diffusion tensor data
    Fletcher, P. Thomas
    Joshi, Sarang
    [J]. SIGNAL PROCESSING, 2007, 87 (02) : 250 - 262
  • [8] PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH
    HERMANSKY, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) : 1738 - 1752
  • [9] LINE SPECTRUM REPRESENTATION OF LINEAR PREDICTOR COEFFICIENTS OF SPEECH SIGNALS
    ITAKURA, F
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 : S35 - S35
  • [10] Speech recognition by machines and humans
    Lippmann, RP
    [J]. SPEECH COMMUNICATION, 1997, 22 (01) : 1 - 15