A novel speech emotion recognition algorithm based on wavelet kernel sparse classifier in stacked deep auto-encoder model

被引：2

作者：

Pengcheng Wei

Yu Zhao

机构：

[1] Chongqing University of Education,School of Mathematics and Information Engineering

来源：

Personal and Ubiquitous Computing | 2019年 / 23卷

关键词：

Contextual information; Emotion recognition; Auto-encoder; Kernel sparse; Sub-utterance-level; Support vector machine; Hidden feature; Deep learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Since the contextual information has an important impact on the speaker’s emotional state, how to use emotion-related context information to conduct feature learning is a key problem. The existing speech emotion recognition algorithms achieve the relatively high recognition rate; these algorithms are not very good application to the real-life speech emotion recognition systems. Therefore, in order to address the abovementioned issues, a novel speech emotion recognition algorithm based on improved stacked kernel sparse deep model is proposed in this paper, which is based on auto-encoder, denoising auto-encoder, and sparse auto-encoder to improve the Chinese speech emotion recognition. The first layer of the structure uses a denoising auto-encoder to learn a hidden feature with a larger dimension than the dimension of the input features, and the second layer employs a sparse auto-encoder to learn sparse features. Finally, a wavelet-kernel sparse SVM classifier is applied to classify the features. The proposed algorithm is evaluated on the testing dataset, which contains the speech emotion data of spontaneous, non-prototypical, and long-term. The experimental results show that the proposed algorithm outperforms the existing state-of-the-art algorithms in speech emotion recognition.

引用

页码：521 / 529

页数：8

共 47 条

[1]

Wang K(2015)Speech emotion recognition using Fourier parameters IEEE Trans Affect Comput 6 69-75

[2]

An N(2017)Speech emotion recognition based on a modified brain emotional learning model Biol Inspired Cogn Architectures 19 32-38

[3]

Li BN(2017)Hybrid BBO_PSO and higher order spectral features for emotion and stress recognition from natural speech Appl Soft Comput 56 217-232

[4]

Motamed S(2018)Emotion recognition by assisted learning with convolutional neural networks Neurocomputing 291 187-194

[5]

Setayeshi S(2018)Heterogeneous knowledge transfer in video emotion recognition, attribution and summarization IEEE Trans Affect Comput 9 255-270

[6]

Rabiee A(2017)Hierarchical convolutional neural networks for EEG-based emotion recognition Cogn Comput 10 368-380

[7]

Yogesh CK(2017)SVM-based feature selection methods for emotion recognition from multimodal data J Multimodal User Interfaces 11 9-23

[8]

Hariharan M(2018)A novel feature set for video emotion recognition Neurocomputing 291 11-20

[9]

Ngadiran R(2018)A hybrid deep learning neural approach for emotion recognition from facial expressions for socially assistive robots Neural Comput Applic 29 359-373

[10]

He X(2017)Dynamic facial landmarking selection for emotion recognition using Gaussian processes J Multimodal User Interfaces 11 327-340

← 1 2 3 4 5 →