Dimensionality Reduction for Emotional Speech Recognition

被引:17
作者
Fewzee, Pouria [1 ]
Karray, Fakhri [1 ]
机构
[1] Univ Waterloo, Ctr Pattern Anal & Machine Intelligence, Waterloo, ON N2L 3G1, Canada
来源
PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012) | 2012年
关键词
D O I
10.1109/SocialCom-PASSAT.2012.83
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The number of speech features that are introduced to emotional speech recognition exceeds some thousands and this makes dimensionality reduction an inevitable part of an emotional speech recognition system. The elastic net, the greedy feature selection, and the supervised principal component analysis are three recently developed dimensionality reduction algorithms that we have considered their application to tackle this issue. Together with PCA, these four methods include both supervised and unsupervised, as well as filter and projection-type dimensionality reduction methods. For experimental reasons, we have chosen VAM corpus. We have extracted two sets of features and have investigated the efficiency of the application of the four dimensionality reduction methods to the combination of the two sets, besides each of the two. The experimental results of this study show that in spite of a dimensionality reduction stage, a longer vector of speech features does not necessarily result in a more accurate prediction of emotion.
引用
收藏
页码:532 / 537
页数:6
相关论文
共 12 条
[1]  
[Anonymous], SIGNAL PROCESSING
[2]   Supervised principal component analysis: Visualization, classification and regression on subspaces and submanifolds [J].
Barshan, Elnaz ;
Ghodsi, Ali ;
Azimifar, Zohreh ;
Jahromi, Mansoor Zolghadri .
PATTERN RECOGNITION, 2011, 44 (07) :1357-1371
[3]  
Farahat A. K., 2011, Proceedings of the 2011 IEEE 11th International Conference on Data Mining (ICDM 2011), P161, DOI 10.1109/ICDM.2011.22
[4]  
Kim JC, 2011, LECT NOTES COMPUT SC, V6975, P369, DOI 10.1007/978-3-642-24571-8_48
[5]  
Meng HY, 2011, LECT NOTES COMPUT SC, V6975, P378, DOI 10.1007/978-3-642-24571-8_49
[6]  
Pan SF, 2011, LECT NOTES COMPUT SC, V6975, P388, DOI 10.1007/978-3-642-24571-8_50
[7]  
Sayedelahl A, 2011, LECT NOTES COMPUT SC, V6975, P407, DOI 10.1007/978-3-642-24571-8_52
[8]  
Schuller B, 2011, LECT NOTES COMPUT SC, V6975, P415, DOI 10.1007/978-3-642-24571-8_53
[9]   Recognizing Affect from Linguistic Information in 3D Continuous Space [J].
Schuller, Bjoern .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2011, 2 (04) :192-205
[10]  
Sun R, 2011, LECT NOTES COMPUT SC, V6975, P425, DOI 10.1007/978-3-642-24571-8_54