Selection of Fractal Dimension Features for Speech Emotion Classification

被引:0
作者
Tamulevicius, Gintautas [1 ]
Karbauskaite, Rasa [2 ]
Dzemvda, Gintautas [2 ]
机构
[1] Vilnius Gediminas Tech Univ, Fac Elect, Vilnius, Lithuania
[2] Vilnius Univ, Inst Math & Informat, Vilnius, Lithuania
来源
2017 OPEN CONFERENCE OF ELECTRICAL, ELECTRONIC AND INFORMATION SCIENCES (ESTREAM) | 2017年
关键词
fractals; feature selection; emotion recognition; MAXIMUM-LIKELIHOOD ESTIMATOR; INTRINSIC DIMENSIONALITY; GEODESIC DISTANCES; RECOGNITION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Despite numerous studies during the last decade speech emotion recognition is still the task of limited success. Great efforts were made for extending emotional speech feature sets and selecting the most effective ones, proposing multi-stage and multiple classifier based classification schemes, and developing multi-modal speech emotion recognition technique. Nevertheless, the reported emotion recognition rates vary from 70 % up to 90 % depending on the analyzed language, the number of recognized emotions, the speaker mode, and other important factors. Considering the nonlinear and fluctuating nature of the spoken language, we present a feature set, based on a fractal dimension (FD) for emotion classification. Katz, Castiglioni, Higuchi, and Hurst exponent-based FD features were employed in 2-7 emotion classification tasks. The experimental results show a clear superiority of FD based feature sets against acoustic ones. The feature selection enabled us to reduce the initial feature set down to 2-7 order sets and to improve thereby the accuracy of speech emotion classification by 11.4 % The obtained average classification accuracy for all tasks was 96.6 %.
引用
收藏
页数:4
相关论文
共 25 条