Low-Order Multi-Level Features for Speech Emotion Recognition

被引:0
作者
Tamulevicius, Gintautas [1 ]
Liogiene, Tatjana [1 ]
机构
[1] Vilnius Univ, Inst Math & Informat, Vilnius, Lithuania
来源
BALTIC JOURNAL OF MODERN COMPUTING | 2015年 / 3卷 / 04期
关键词
speech emotion recognition; features; feature selection; classification;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Various feature selection and classification schemes were proposed to improve efficiency of speech emotion classification and recognition. In this paper we propose multi-level organization of classification process and features. The main idea is to perform classification of speech emotions in step-by-step manner using different feature subsets for every step. We applied the maximal efficiency feature selection criterion for composition of feature subsets in different classification levels. The proposed multi-level organization of classification and features was tested experimentally in two emotions, three emotions, and four emotions recognition tasks and was compared with conventional feature combination techniques. Using the maximal efficiency feature selection criterion 2nd and 16th order multi-level feature sets were composed for three and four emotions recognition tasks respectively. Experimental results show the superiority of proposed multi-level classification scheme by 6,3-25,6 % against straightforward classification and conventional feature combination schemes.
引用
收藏
页码:234 / 247
页数:14
相关论文
共 21 条
  • [1] [Anonymous], 2009, 3 INT C BIO BIOM ENG
  • [2] Burkhardt F., 9 EUROPEAN C SPEECH, P1517, DOI [10.21437/ interspeech.2005-446, DOI 10.21437/INTERSPEECH.2005-446]
  • [3] Multistyle classification of speech under stress using feature subset selection based on genetic algorithms
    Casale, Salvatore
    Russo, Alessandra
    Serrano, Salvatore
    [J]. SPEECH COMMUNICATION, 2007, 49 (10-11) : 801 - 810
  • [4] Chiou B.C., 2013, SIGNAL INFORM PROCES, P1
  • [5] Dellaert F, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1970, DOI 10.1109/ICSLP.1996.608022
  • [6] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
  • [7] Eyben F, 2009, PROC INT C AFFECT CO, P1, DOI [DOI 10.1109/ACII.2009.5349350, 10.1109/ACII.2009.5349350]
  • [8] Speech emotion recognition using FCBF feature selection method and GA-optimized fuzzy ARTMAP neural network
    Gharavian, Davood
    Sheikhan, Mansour
    Nazerieh, Alireza
    Garoucy, Sahar
    [J]. NEURAL COMPUTING & APPLICATIONS, 2012, 21 (08) : 2115 - 2126
  • [9] Giannoulis P, 2012, LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1203
  • [10] Koolagudi S.G., 2010, P 2010 INT C SIGN PR, P1, DOI [10.1109/SPCOM16513.2010, DOI 10.1109/SPCOM16513.2010]