COMBINING FEATURE SELECTION AND REPRESENTATION FOR SPEECH EMOTION RECOGNITION

被引:0
作者
Han, Wenjing [1 ]
Ruan, Huabin [2 ]
Yu, Xiaojie [1 ]
Zhu, Xuan [1 ]
机构
[1] Samsung R&D Inst China Beijing SRC B, Language Comp Lab, Beijing, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
来源
2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW) | 2016年
关键词
speech emotion recognition; multiple kernel learning; denoising autoencoder; feature selection; feature representation;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a feature selection and representation combination method to generate discriminative features for speech emotion recognition. In feature selection stage, a Multiple Kernel Learning (MKL) based strategy is used to obtain the optimal feature subset. Specifically, features selected at least n times among 10-fold cross validation are collected to build a new feature subset named n-subset, then the n-subset resulting in the highest classification accuracy is viewed as the optimal one. In feature representation stage, the optimal feature subset is mapped to a hidden representation using a denoising autoencoder (DAE). The model parameters are learned by minimizing the squared error between the original and the reconstructed input. The hidden representation is then used as the final feature set in the MKL model for emotion recognition. Our experimental results show significant performance improvement compared to using the original features in both of the inner-corpus and cross-corpus scenarios.
引用
收藏
页数:5
相关论文
共 50 条
[21]   Decision tree SVM model with Fisher feature selection for speech emotion recognition [J].
Linhui Sun ;
Sheng Fu ;
Fu Wang .
EURASIP Journal on Audio, Speech, and Music Processing, 2019
[22]   Speech emotion recognition based on feature selection and extreme learning machine decision tree [J].
Liu, Zhen-Tao ;
Wu, Min ;
Cao, Wei-Hua ;
Mao, Jun-Wei ;
Xu, Jian-Ping ;
Tan, Guan-Zheng .
NEUROCOMPUTING, 2018, 273 :271-280
[23]   SPEECH EMOTION RECOGNITION METHOD BASED ON IMPROVED DECISION TREE AND LAYERED FEATURE SELECTION [J].
Mao, Qirong ;
Wang, Xiaojia ;
Zhan, Yongzhao .
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2010, 7 (02) :245-261
[24]   Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms [J].
Langari, Shadi ;
Marvi, Hossein ;
Zahedi, Morteza .
INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2020, 11 (01) :81-92
[25]   Joint subspace learning and feature selection method for speech emotion recognition [J].
Song P. ;
Zheng W. ;
Zhao L. .
Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (04) :347-351
[26]   Pertinent feature selection techniques for automatic emotion recognition in stressed speech [J].
Tiwari P. ;
Darji A.D. .
International Journal of Speech Technology, 2022, 25 (02) :511-526
[27]   Investigation on Joint Representation Learning for Robust Feature Extraction in Speech Emotion Recognition [J].
Luo, Danqing ;
Zou, Yuexian ;
Huang, Dongyan .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :152-156
[28]   Bi-Feature Selection Deep Learning-Based Techniques for Speech Emotion Recognition [J].
Akinpelu, Samson ;
Viriri, Serestina .
ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I, 2025, 15046 :345-356
[29]   Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech [J].
Leem, Seong-Gyun ;
Fulford, Daniel ;
Onnela, Jukka-Pekka ;
Gard, David ;
Busso, Carlos .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 :917-929
[30]   KOLMOGOROV-SMIRNOV TEST FOR FEATURE SELECTION IN EMOTION RECOGNITION FROM SPEECH [J].
Ivanov, Alexei ;
Riccardi, Giuseppe .
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :5125-5128