An Effective Speech Emotion Recognition Using Artificial Neural Networks

被引:0
作者
Anoop, V. [1 ]
Rao, P. V. [2 ]
Aruna, S. [2 ]
机构
[1] VTU Belgaum, Dept ECE, Belagavi, Karnataka, India
[2] Rajarajeswari Coll Engn, Bengaluru, India
来源
INTERNATIONAL PROCEEDINGS ON ADVANCES IN SOFT COMPUTING, INTELLIGENT SYSTEMS AND APPLICATIONS, ASISA 2016 | 2018年 / 628卷
关键词
Speech emotion recognition; Modified cuckoo search; SNR; ANN classifier;
D O I
10.1007/978-981-10-5272-9_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The speech signal has emerged as the quickest and the most normal mode of communication between the human beings. As an indispensable method of the human emotional behaviour comprehension, the speech emotion recognition (SER) has evinced zooming significance in the human-centred signal processing. In the current document, the speech signal is identified and distinguished by way of four distinct phases such as the feature extraction; modified cuckoo search-based generating finest weight, feature analysis and the artificial neural network (ANN) classification. The speech signal categorization, in turn, is performed in the working platform of MATLAB, and outcomes are assessed to ascertain the efficiency in the performance of the system. The performance metrics are analysed and made comparison for an improvement of the sensitivity 0.692%, specificity 24% and accuracy 5.88% with the existing method.
引用
收藏
页码:393 / 401
页数:9
相关论文
共 12 条
[1]  
Anoop V, 2014, INT J ENG INNOVATION, V2, P201
[2]  
Anoop V, 2015, IEEE DIG LIB INT C C, P1322
[3]   Survey on speech emotion recognition: Features, classification schemes, and databases [J].
El Ayadi, Moataz ;
Kamel, Mohamed S. ;
Karray, Fakhri .
PATTERN RECOGNITION, 2011, 44 (03) :572-587
[4]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[5]  
Falk T. H., 2010, IEEE T AUDIO SPEECH, V18
[6]  
Hu Y, 2006, INT CONF ACOUST SPEE, P153
[7]  
Huang Zhengwei, 2014, IEEE T AUDIO SPEECH
[8]   Particle Filter Enhancement of Speech Spectral Amplitudes [J].
Laska, Brady N. M. ;
Bolic, Miodrag ;
Goubran, Rafik A. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08) :2155-2167
[9]   Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks [J].
Mao, Qirong ;
Dong, Ming ;
Huang, Zhengwei ;
Zhan, Yongzhao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (08) :2203-2213
[10]  
Mao Xia, 2009, P WORLD C COMP SCI I