Automatic Recognition of Anger in Spontaneous Speech

被引:0
作者
Neiberg, Daniel [1 ]
Elenius, Kjell [1 ]
机构
[1] KTH, Royal Inst Technol, CSC, Ctr Speech Technol, Stockholm, Sweden
来源
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年
关键词
spontaneous speech; natural emotions; anger;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic detection of real life negative emotions in speech has been evaluated using Linear Discriminant Analysis, LDA, with "classic" emotion features and a classifier based on Gaussian Mixture Models, GMMs. The latter uses Mel-Frequency Cepstral Coefficients, MFCCs, from a filter bank covering the 300-3400 Hz region to capture spectral shape and formants, and another in the 20-600 Hz region to capture prosody. Both classifiers have been tested on an extensive corpus from Swedish voice controlled telephone services. The results indicate that it is possible to detect anger with reasonable accuracy (average recall 83%) in natural speech and that the GMM method performed better than the LDA one.
引用
收藏
页码:2755 / 2758
页数:4
相关论文
共 18 条
  • [1] [Anonymous], ICSLP
  • [2] [Anonymous], 2000, JOVONK, V2
  • [3] [Anonymous], P INT PITTSB PA US
  • [4] How to find trouble in communication
    Batliner, A
    Fischer, K
    Huber, R
    Spilker, J
    Nöth, E
    [J]. SPEECH COMMUNICATION, 2003, 40 (1-2) : 117 - 143
  • [5] Boersma P, Praat: doing phonetics by computer (Version 6.0.14)
  • [6] BURKHARDT F, 2006, P 9 INT C SPOK LANG
  • [7] EDLUND J, SPEECH COMM IN PRESS
  • [8] AN ARGUMENT FOR BASIC EMOTIONS
    EKMAN, P
    [J]. COGNITION & EMOTION, 1992, 6 (3-4) : 169 - 200
  • [9] FORSELL M, 2007, P FON TMH QPSR, V50, P37
  • [10] LAUKKA P, 2008, P LREC WORKSH CORP R