Emotion recognition from speech signals using new harmony features

被引:102
作者
Yang, B. [1 ]
Lugger, M. [1 ]
机构
[1] Univ Stuttgart, Chair Syst Theory & Signal Proc, D-70550 Stuttgart, Germany
关键词
Emotion recognition; Feature extraction; Harmony features; Pitch interval;
D O I
10.1016/j.sigpro.2009.09.009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we propose a new set of harmony features for automatic emotion recognition from speech signals. They are based on the psychoacoustic harmony perception known from music theory. Starting from the estimated pitch contour of an utterance, we calculate the circular autocorrelation of the pitch histogram on the logarithmic semitone scale. It measures the occurrence of different two-pitch intervals which cause a consonant or dissonant impression. Experiments of emotion recognition using these harmony parameters in addition to state of the art features show an improved recognition performance. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1415 / 1423
页数:9
相关论文
共 53 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264
[3]  
[Anonymous], 2001, P EUROSPEECH
[4]  
[Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
[5]  
[Anonymous], 1998, FUNDEMENTALS STAT SI
[6]  
[Anonymous], 2000, Pattern Classification
[7]  
[Anonymous], 1980, The phonetic description of voice quality
[8]  
[Anonymous], P 16 INT C PHON SCI
[9]   Acoustic profiles in vocal emotion expression [J].
Banse, R ;
Scherer, KR .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1996, 70 (03) :614-636
[10]  
Batliner A., 2004, PROC LREC, P171