COMPARISON OF ADAPTATION METHODS FOR GMM-SVM BASED SPEECH EMOTION RECOGNITION

被引:0
作者
Jiang, Jianbo [1 ]
Wu, Zhiyong [1 ]
Xu, Mingxing
Jia, Jia
Cai, Lianhong [1 ]
机构
[1] Tsinghua Univ, Technol & Syst Grad Sch Shenzhen, Tsinghua CUHK Joint Res Ctr Media Sci, Shenzhen 518055, Peoples R China
来源
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012) | 2012年
关键词
emotion recognition; GMM supervector based SVM; MAP adaptation; MLLR adaptation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The required length of the utterance is one of the key factors affecting the performance of automatic emotion recognition. To gain the accuracy rate of emotion distinction, adaptation algorithms that can be manipulated on short utterances are highly essential. Regarding this, this paper compares two classical model adaptation methods, maximum a posteriori (MAP) and maximum likelihood linear regression (MLLR), in GMM-SVM based emotion recognition, and tries to find which method can perform better on different length of the enrollment of the utterances. Experiment results show that MLLR adaptation performs better for very short enrollment utterances (with the length shorter than 2s) while MAP adaptation is more effective for longer utterances.
引用
收藏
页码:269 / 273
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 2005, P INTERSPEECH
[2]  
Campbell W.D., 2006, P ICASSP
[3]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4]  
Dai M., 2011, P NCMMSC
[5]  
Hu H, 2007, INT CONF ACOUST SPEE, P413
[6]  
Lee Chin-Hui., 2004, P ICSLP
[7]  
Luengo I., 2005, INTERSPEECH 2005, P493
[8]  
Mak B., 2007, P INTERSPEECH, P266
[9]  
Mak MW, 2006, INT CONF ACOUST SPEE, P929
[10]  
Mariethoz J., 2002, P ICSLP