Recognition of Emotion Intensity Basing on Neutral Speech Model

被引：2

作者：

Kaminska, Dorota ^{[1
]}

Sapinski, Tomasz ^{[1
]}

Pelikant, Adam ^{[1
]}

机构：

[1] Tech Univ Lodz, Inst Mechatron & Informat Syst, PL-90924 Lodz, Poland

来源：

MAN-MACHINE INTERACTIONS 3 | 2014年 / 242卷

关键词：

emotion recognition; signal processing; Plutchik's model; emotion classication; EVOLUTION;

D O I：

10.1007/978-3-319-02309-0_49

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Research in emotional speech recognition is generally focused on analysis of a set of primary emotions. However it is clear that spontaneous speech, which is more intricate comparing to acted out utterances, carries information about emotional complexity or degree of their intensity. This paper refers to the theory of Robert Plutchik, who suggested the existence of eight primary emotions. All other states are derivatives and occur as combinations, mixtures or compounds of the primary emotions. During the analysis Polish spontaneous speech database containing manually created confidence labels was implemented as a training and testing set. Classification results of four primary emotions (anger, fear, joy, sadness) and their intensities have been presented. The level of intensity is determined basing on the similarity of particular emotion to neutral speech. Studies have been conducted using prosodic features and perceptual coefficients. Results have shown that the proposed measure is effective in recognition of intensity of the predicted emotion.

引用

页码：451 / 458

页数：8

共 20 条

[1]

[Anonymous], 2012, TENCON IEEE REG 10 C

[2]

[Anonymous], COMMUNICATIONS CONTR

[3]

[Anonymous], 2010, ROZPOZNAWANIE BIOMET

[4]

Attabi Y., 2012, 2012 11th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA), P126, DOI 10.1109/ISSPA.2012.6310487

[5]

Bojanic M., 2012, 2012 11th Symposium on Neural Network Applications in Electrical Engineering (NEUREL 2012). Proceedings, P223, DOI 10.1109/NEUREL.2012.6420016

[6]

Burkhardt F., 2005, INTERSPEECH, V5, P1517, DOI DOI 10.21437/INTERSPEECH.2005-446

[7]

Christina I.J., 2012, P INT C COMP EL EL T, P723

[8]

DENG J, 2012, P SPEECH COMM 10 ITG, V10, P1

[9] Dimensionality Reduction for Emotional Speech Recognition [J].

Fewzee, Pouria ;

Karray, Fakhri .

PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012), 2012, :532-537

[10]

Garay Nestor., 2006, Human technology, V2, P55, DOI [10.17011/ht/urn.2006159, DOI 10.17011/ht/urn.2006159]

← 1 2 →