Application of Neural Networks in Emotional Speech Recognition

被引:0
作者
Bojanic, Milana [1 ]
Crnojevic, Vladimir [1 ]
Delic, Vlado [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia
来源
ELEVENTH SYMPOSIUM ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING (NEUREL 2012) | 2012年
关键词
emotional speech recognition; neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotional speech recognition (ESR) from the aspect of human-machine interaction (HCI) is a prerequisite for the framework of interacting partners within the HCI. This paper addresses the application of neural network (NN) in ESR. The performance of NN is tested using three different feature sets which are basis for ESR: prosodic features, spectral features and a set of their combination. The results of these feature sets are compared using several network topologies and two training algorithms. It has been shown that using joint prosodic-spectral feature set as input to three layer feed-forward NN trained with back-propagation algorithm has the best performance in 5-class emotional speech recognition task.
引用
收藏
页数:4
相关论文
共 13 条
  • [11] Petrushin V., 1999, Artificial Neural Networks In Engineering (ANNIE), P7
  • [12] Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge
    Schuller, Bjorn
    Batliner, Anton
    Steidl, Stefan
    Seppi, Dino
    [J]. SPEECH COMMUNICATION, 2011, 53 (9-10) : 1062 - 1087
  • [13] EMOTIONS AND SPEECH - SOME ACOUSTICAL CORRELATES
    WILLIAMS, CE
    STEVENS, KN
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (04) : 1238 - &