Spontaneous speech emotion recognition via multiple kernel learning

被引：2

作者：

Zha, Cheng ^{[1
,2
]}

Yang, Ping ^{[2
]}

Zhang, Xinran ^{[1
]}

Zhao, Li ^{[1
]}

机构：

[1] Southeast Univ, Key Lab Underwater Acoust Signal Proc, Minist Educ, Nanjing 210096, Jiangsu, Peoples R China

[2] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550002, Peoples R China

来源：

PROCEEDINGS 2016 EIGHTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION ICMTMA 2016 | 2016年

关键词：

Speech emotion recognition; Multiple kernel learning; SVM; Nonlinear mapping function;

D O I：

10.1109/ICMTMA.2016.152

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech emotion recognition has become an active topic in pattern recognition. Specifically, support vector machine (SVM) is an effective classifier due to the application of the nonlinear mapping function, which can map the data into high or ever infinite dimensional feature space. However, a single kernel function might not sufficient to describe the different properties of spontaneous speech emotion data and thus it can not produce a satisfactory decision function. To address this issue, we apply multiple kernel learning (MKL) algorithm to recognize the spontaneous speech emotion. The experimental results are evaluated on the spontaneous speech emotion database such as FAU Aibo database. Compared to SVM, MKL can achieve better performance on spontaneous speech emotion recognition.

引用

页码：621 / 623

页数：3

共 16 条

[1] [Anonymous], 2012, INT J SMART HOME
[2] [Anonymous], 2011, MACH LEARN RES
[3] Survey on speech emotion recognition: Features, classification schemes, and databases
El Ayadi, Moataz
Kamel, Mohamed S.
Karray, Fakhri
[J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
[4] Eyben F, 2009, AFF COMP INT INT WOR
[5] Categorical and dimensional affect analysis in continuous input: Current trends and future directions
Gunes, Hatice
Schuller, Bjoern
[J]. IMAGE AND VISION COMPUTING, 2013, 31 (02) : 120 - 136
[6] Hu H., 2007, ICASSP
[7] Emotion recognition using a hierarchical binary decision tree approach
Lee, Chi-Chun
Mower, Emily
Busso, Carlos
Lee, Sungbok
Narayanan, Shrikanth
[J]. SPEECH COMMUNICATION, 2011, 53 (9-10) : 1162 - 1171
[8] Incorporation of radius-info can be simple with SimpleMKL
Liu, Xinwang
Wang, Lei
Yin, Jianping
Liu, Lingqiao
[J]. NEUROCOMPUTING, 2012, 89 : 30 - 38
[9] Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language
Narayanan, Shrikanth
Georgiou, Panayiotis G.
[J]. PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1203 - 1233
[10] Ntalampiras Stavros, 2012, AFFECTIVE COMPUTING, V3, P116

← 1 2 →