Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition

被引:52
作者
Song, Peng [1 ]
Zheng, Wenming [2 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
[2] Southeast Univ, Res Ctr Learning Sci, Minist Educ, Key Lab Child Dev & Learning Sci, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; transfer learning; subspace learning; speech emotion recognition; FRAMEWORK;
D O I
10.1109/TAFFC.2018.2800046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-corpus speech emotion recognition has recently received considerable attention due to the widespread existence of various emotional speech. It takes one corpus as the training data aiming to recognize emotions of another corpus, and generally involves two basic problems, i.e., feature matching and feature selection. Many previous works study these two problems independently, or just focus on solving the first problem. In this paper, we propose a novel algorithm, called feature selection based transfer subspace learning (FSTSL), to address these two problems. To deal with the first problem, a latent common subspace is learnt by reducing the difference of different corpora and preserving the important properties. Meanwhile, we adopt the l(2,1)-norm on the projection matrix to deal with the second problem. Besides, to guarantee the subspace to be robust and discriminative, the geometric information of data is exploited simultaneously in the proposed FSTSL framework. Empirical experiments on cross-corpus speech emotion recognition tasks demonstrate that our proposed method can achieve encouraging results in comparison with state-of-the-art algorithms.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 71 条
[1]  
Aldeneh Z, 2017, INT CONF ACOUST SPEE, P2741, DOI 10.1109/ICASSP.2017.7952655
[2]  
[Anonymous], 2019, IEEE T AFFECT COMPUT, DOI DOI 10.1109/TAFFC.2017.2684799
[3]  
[Anonymous], 2016, EUR SIGNAL PR CONF
[4]  
[Anonymous], 2019, IEEE T AFFECT COMPUT, DOI DOI 10.1109/TAFFC.2017.2705696
[5]  
[Anonymous], 2014, 2014 9 INT S CHIN
[6]  
[Anonymous], 2012, PROC CVPR IEEE
[7]  
[Anonymous], 2017, IEEE ACM T AUDIO SPE, DOI DOI 10.1109/TASLP.2017.2694704
[8]   Feature Selection with Annealing for Computer Vision and Big Data Learning [J].
Barbu, Adrian ;
She, Yiyuan ;
Ding, Liangjing ;
Gramajo, Gary .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (02) :272-286
[9]   Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].
Belhumeur, PN ;
Hespanha, JP ;
Kriegman, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720
[10]  
Belkin M, 2002, ADV NEUR IN, V14, P585