Speech emotion recognition based on statistical pitch model

被引:3
作者
WANG Zhiping ZHAO Li ZOU Cairong (Department of Radio Engineering
机构
基金
中国国家自然科学基金;
关键词
Speech emotion recognition based on statistical pitch model;
D O I
10.15949/j.cnki.0217-9776.2006.01.009
中图分类号
O429 [应用声学];
学科分类号
070206 ; 082403 ;
摘要
A modified Parzen-window method, which keep high resolution in low frequencies and keep smoothness in high frequencies, is proposed to obtain statistical model. Then, a gender classification method utilizing the statistical model is proposed, which have a 98% accuracy of gender classification while long sentence is dealt with. By separation the male voice and female voice, the mean and standard deviation of speech training samples with different emotion are used to create the corresponding emotion models. Then the Bhattacharyya distance between the test sample and statistical models of pitch, are utilized for emotion recognition in speech. The normalization of pitch for the male voice and female voice are also considered, in order to illustrate them into a uniform space. Finally, the speech emotion recognition experiment based on K Nearest Neighbor shows that, the correct rate of 81% is achieved, where it is only 73.85% if the traditional parameters are utilized.
引用
收藏
页码:87 / 96
页数:10
相关论文
共 1 条
  • [1] Regulation and Entrainment in Human-Robot Interaction .2 Breazeal C. International Journal of Robotic Research . 2002