Mandarin emotion recognition combining acoustic and emotional point information

被引：18

作者：

Chen, Lijiang ^{[1
]}

Mao, Xia ^{[1
]}

Wei, Pengfei ^{[1
]}

Xue, Yuli ^{[1
]}

Ishizuka, Mitsuru ^{[2
]}

机构：

[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China

[2] Univ Tokyo, Dept Informat & Commun Engn, Tokyo, Japan

来源：

APPLIED INTELLIGENCE | 2012年 / 37卷 / 04期

关键词：

Mandarin emotion recognition; Emotional point; Fisher rate; Support vector machine; Hidden Markov model; FEATURE VECTOR NORMALIZATION; SPEECH RECOGNITION; PROFILES; FEATURES; SYSTEMS;

D O I：

10.1007/s10489-012-0352-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this contribution, we introduce a novel approach to combine acoustic information and emotional point information for a robust automatic recognition of a speaker's emotion. Six discrete emotional states are recognized in the work. Firstly, a multi-level model for emotion recognition by acoustic features is presented. The derived features are selected by fisher rate to distinguish different types of emotions. Secondly, a novel emotional point model for Mandarin is established by Support Vector Machine and Hidden Markov Model. This model contains 28 emotional syllables which reflect rich emotional information. Finally the acoustic information and emotional point information are integrated by a soft decision strategy. Experimental results show that the application of emotional point information in speech emotion recognition is effective.

引用

页码：602 / 612

页数：11

共 43 条

[1] Robust polynomial classifier using L 1-norm minimization [J].

Assaleh, K. ;

Shanableh, T. .

APPLIED INTELLIGENCE, 2010, 33 (03) :330-339

[2] Acoustic profiles in vocal emotion expression [J].

Banse, R ;

Scherer, KR .

JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1996, 70 (03) :614-636

[3] Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].

Belhumeur, PN ;

Hespanha, JP ;

Kriegman, DJ .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720

[4]

Chao Y. R., 1965, GRAMMAR SPOKEN CHINE

[5]

Chen Y, 1990, SPEECH SIGNAL PROCES

[6] Towards creative evolutionary systems with interactive genetic algorithm [J].

Cho, SB .

APPLIED INTELLIGENCE, 2002, 16 (02) :129-138

[7] Tracking affective components of satisfaction [J].

Coghlan, Alexandra ;

Pearce, Philip .

TOURISM AND HOSPITALITY RESEARCH, 2010, 10 (01) :42-58

[8] SUPPORT-VECTOR NETWORKS [J].

CORTES, C ;

VAPNIK, V .

MACHINE LEARNING, 1995, 20 (03) :273-297

[9] Extracting phonetic knowledge from learning systems: Perceptrons, support vector machines and linear discriminants [J].

Damper, RI ;

Gunn, SR ;

Gore, MO .

APPLIED INTELLIGENCE, 2000, 12 (1-2) :43-62

[10] AN ARGUMENT FOR BASIC EMOTIONS [J].

EKMAN, P .

COGNITION & EMOTION, 1992, 6 (3-4) :169-200

← 1 2 3 4 5 →