COMBINING FEATURE SELECTION AND REPRESENTATION FOR SPEECH EMOTION RECOGNITION

被引：0

作者：

Han, Wenjing ^{[1
]}

Ruan, Huabin ^{[2
]}

Yu, Xiaojie ^{[1
]}

Zhu, Xuan ^{[1
]}

机构：

[1] Samsung R&D Inst China Beijing SRC B, Language Comp Lab, Beijing, Peoples R China

[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW) | 2016年

关键词：

speech emotion recognition; multiple kernel learning; denoising autoencoder; feature selection; feature representation;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a feature selection and representation combination method to generate discriminative features for speech emotion recognition. In feature selection stage, a Multiple Kernel Learning (MKL) based strategy is used to obtain the optimal feature subset. Specifically, features selected at least n times among 10-fold cross validation are collected to build a new feature subset named n-subset, then the n-subset resulting in the highest classification accuracy is viewed as the optimal one. In feature representation stage, the optimal feature subset is mapped to a hidden representation using a denoising autoencoder (DAE). The model parameters are learned by minimizing the squared error between the original and the reconstructed input. The hidden representation is then used as the final feature set in the MKL model for emotion recognition. Our experimental results show significant performance improvement compared to using the original features in both of the inner-corpus and cross-corpus scenarios.

引用

页数：5

共 50 条

[31] Speech feature selection and emotion recognition based on weighted binary cuckoo search [J].

Zhang, Zicheng .

ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) :1499-1507

[32] A modified feature selection method based on metaheuristic algorithms for speech emotion recognition [J].

Yildirim, Serdar ;

Kaya, Yasin ;

Kilic, Fatih .

APPLIED ACOUSTICS, 2021, 173

[33] Emotion Recognition from Speech using Extended Feature Selection and a Simple Classifier [J].

Hassan, Ali ;

Damper, Robert I. .

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, :2011-2014

[34] Metric Learning Based Feature Representation with Gated Fusion Model for Speech Emotion Recognition [J].

Gao, Yuan ;

Liu, JiaXing ;

Wang, Longbiao ;

Dang, Jianwu .

INTERSPEECH 2021, 2021, :4503-4507

[35] Feature Selection for Music Emotion Recognition [J].

Widiyanti, Emilia ;

Endah, Sukmawati Nur .

2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, :120-124

[36] SFS feature selection technique for multistage emotion recognition [J].

Liogiene, Tatjana ;

Tamulevicius, Gintautas .

PROCEEDINGS OF THE 2015 IEEE 3RD WORKSHOP ON ADVANCES IN INFORMATION, ELECTRONIC AND ELECTRICAL ENGINEERING (AIEEE 2015), 2015,

[37] Speech Emotion Recognition Based on Sparse Representation [J].

Yan, Jingjie ;

Wang, Xiaolan ;

Gu, Weiyi ;

Ma, Lili .

ARCHIVES OF ACOUSTICS, 2013, 38 (04) :465-470

[38] A Feature Fusion Model with Data Augmentation for Speech Emotion Recognition [J].

Tu, Zhongwen ;

Liu, Bin ;

Zhao, Wei ;

Yan, Raoxin ;

Zou, Yang .

APPLIED SCIENCES-BASEL, 2023, 13 (07)

[39] Speech emotion recognition with unsupervised feature learning [J].

Zheng-wei HUANG ;

Wen-tao XUE ;

Qi-rong MAO .

Frontiers of Information Technology & Electronic Engineering, 2015, 16 (05) :358-366

[40] Acoustic feature analysis and optimization for Bangla speech emotion recognition [J].

Sultana, Sadia ;

Rahman, Mohammad Shahidur .

ACOUSTICAL SCIENCE AND TECHNOLOGY, 2023, 44 (03) :157-166

← 1 2 3 4 5 →