Spontaneous speech emotion recognition via multiple kernel learning

被引:2
作者
Zha, Cheng [1 ,2 ]
Yang, Ping [2 ]
Zhang, Xinran [1 ]
Zhao, Li [1 ]
机构
[1] Southeast Univ, Key Lab Underwater Acoust Signal Proc, Minist Educ, Nanjing 210096, Jiangsu, Peoples R China
[2] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550002, Peoples R China
来源
PROCEEDINGS 2016 EIGHTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION ICMTMA 2016 | 2016年
关键词
Speech emotion recognition; Multiple kernel learning; SVM; Nonlinear mapping function;
D O I
10.1109/ICMTMA.2016.152
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech emotion recognition has become an active topic in pattern recognition. Specifically, support vector machine (SVM) is an effective classifier due to the application of the nonlinear mapping function, which can map the data into high or ever infinite dimensional feature space. However, a single kernel function might not sufficient to describe the different properties of spontaneous speech emotion data and thus it can not produce a satisfactory decision function. To address this issue, we apply multiple kernel learning (MKL) algorithm to recognize the spontaneous speech emotion. The experimental results are evaluated on the spontaneous speech emotion database such as FAU Aibo database. Compared to SVM, MKL can achieve better performance on spontaneous speech emotion recognition.
引用
收藏
页码:621 / 623
页数:3
相关论文
共 16 条
  • [1] [Anonymous], 2012, INT J SMART HOME
  • [2] [Anonymous], 2011, MACH LEARN RES
  • [3] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
  • [4] Eyben F, 2009, AFF COMP INT INT WOR
  • [5] Categorical and dimensional affect analysis in continuous input: Current trends and future directions
    Gunes, Hatice
    Schuller, Bjoern
    [J]. IMAGE AND VISION COMPUTING, 2013, 31 (02) : 120 - 136
  • [6] Hu H., 2007, ICASSP
  • [7] Emotion recognition using a hierarchical binary decision tree approach
    Lee, Chi-Chun
    Mower, Emily
    Busso, Carlos
    Lee, Sungbok
    Narayanan, Shrikanth
    [J]. SPEECH COMMUNICATION, 2011, 53 (9-10) : 1162 - 1171
  • [8] Incorporation of radius-info can be simple with SimpleMKL
    Liu, Xinwang
    Wang, Lei
    Yin, Jianping
    Liu, Lingqiao
    [J]. NEUROCOMPUTING, 2012, 89 : 30 - 38
  • [9] Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language
    Narayanan, Shrikanth
    Georgiou, Panayiotis G.
    [J]. PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1203 - 1233
  • [10] Ntalampiras Stavros, 2012, AFFECTIVE COMPUTING, V3, P116