A novel speech emotion recognition algorithm based on wavelet kernel sparse classifier in stacked deep auto-encoder model

被引:2
作者
Pengcheng Wei
Yu Zhao
机构
[1] Chongqing University of Education,School of Mathematics and Information Engineering
来源
Personal and Ubiquitous Computing | 2019年 / 23卷
关键词
Contextual information; Emotion recognition; Auto-encoder; Kernel sparse; Sub-utterance-level; Support vector machine; Hidden feature; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Since the contextual information has an important impact on the speaker’s emotional state, how to use emotion-related context information to conduct feature learning is a key problem. The existing speech emotion recognition algorithms achieve the relatively high recognition rate; these algorithms are not very good application to the real-life speech emotion recognition systems. Therefore, in order to address the abovementioned issues, a novel speech emotion recognition algorithm based on improved stacked kernel sparse deep model is proposed in this paper, which is based on auto-encoder, denoising auto-encoder, and sparse auto-encoder to improve the Chinese speech emotion recognition. The first layer of the structure uses a denoising auto-encoder to learn a hidden feature with a larger dimension than the dimension of the input features, and the second layer employs a sparse auto-encoder to learn sparse features. Finally, a wavelet-kernel sparse SVM classifier is applied to classify the features. The proposed algorithm is evaluated on the testing dataset, which contains the speech emotion data of spontaneous, non-prototypical, and long-term. The experimental results show that the proposed algorithm outperforms the existing state-of-the-art algorithms in speech emotion recognition.
引用
收藏
页码:521 / 529
页数:8
相关论文
共 47 条
[1]  
Wang K(2015)Speech emotion recognition using Fourier parameters IEEE Trans Affect Comput 6 69-75
[2]  
An N(2017)Speech emotion recognition based on a modified brain emotional learning model Biol Inspired Cogn Architectures 19 32-38
[3]  
Li BN(2017)Hybrid BBO_PSO and higher order spectral features for emotion and stress recognition from natural speech Appl Soft Comput 56 217-232
[4]  
Motamed S(2018)Emotion recognition by assisted learning with convolutional neural networks Neurocomputing 291 187-194
[5]  
Setayeshi S(2018)Heterogeneous knowledge transfer in video emotion recognition, attribution and summarization IEEE Trans Affect Comput 9 255-270
[6]  
Rabiee A(2017)Hierarchical convolutional neural networks for EEG-based emotion recognition Cogn Comput 10 368-380
[7]  
Yogesh CK(2017)SVM-based feature selection methods for emotion recognition from multimodal data J Multimodal User Interfaces 11 9-23
[8]  
Hariharan M(2018)A novel feature set for video emotion recognition Neurocomputing 291 11-20
[9]  
Ngadiran R(2018)A hybrid deep learning neural approach for emotion recognition from facial expressions for socially assistive robots Neural Comput Applic 29 359-373
[10]  
He X(2017)Dynamic facial landmarking selection for emotion recognition using Gaussian processes J Multimodal User Interfaces 11 327-340