Application of Neural Networks in Emotional Speech Recognition

被引:0
作者
Bojanic, Milana [1 ]
Crnojevic, Vladimir [1 ]
Delic, Vlado [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia
来源
ELEVENTH SYMPOSIUM ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING (NEUREL 2012) | 2012年
关键词
emotional speech recognition; neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotional speech recognition (ESR) from the aspect of human-machine interaction (HCI) is a prerequisite for the framework of interacting partners within the HCI. This paper addresses the application of neural network (NN) in ESR. The performance of NN is tested using three different feature sets which are basis for ESR: prosodic features, spectral features and a set of their combination. The results of these feature sets are compared using several network topologies and two training algorithms. It has been shown that using joint prosodic-spectral feature set as input to three layer feed-forward NN trained with back-propagation algorithm has the best performance in 5-class emotional speech recognition task.
引用
收藏
页数:4
相关论文
共 13 条
  • [1] Delic V., ELECT ELECT IN PRESS
  • [2] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
  • [3] Eyben F., 2010, P 18 ACM INT C MULT, p1459 1462
  • [4] Recognizing affect from speech prosody using hierarchical graphical models
    Fernandez, Raul
    Picard, Rosalind
    [J]. SPEECH COMMUNICATION, 2011, 53 (9-10) : 1088 - 1103
  • [5] Acoustical properties of speech as indicators of depression and suicidal risk
    France, DJ
    Shiavi, RG
    Silverman, S
    Silverman, M
    Wilkes, DM
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2000, 47 (07) : 829 - 837
  • [6] Speech under stress: Analysis, modeling and recognition
    Center for Robust Speech Systems, University of Texas at Dallas, Richardson, TX 75080, United States
    [J]. Lect. Notes Comput. Sci., 2007, (108-137): : 108 - 137
  • [7] Jovicic S.T., 2004, PROC SPECOM, P77
  • [8] Toward detecting emotions in spoken dialogs
    Lee, CM
    Narayanan, SS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (02): : 293 - 303
  • [9] Emotion recognition in speech using neural networks
    Nicholson, J
    Takahashi, K
    Nakatsu, R
    [J]. NEURAL COMPUTING & APPLICATIONS, 2000, 9 (04) : 290 - 296
  • [10] Speech emotion recognition using hidden Markov models
    Nwe, TL
    Foo, SW
    De Silva, LC
    [J]. SPEECH COMMUNICATION, 2003, 41 (04) : 603 - 623