Emotion Recognition from Speech Signals using Excitation Source and Spectral Features

被引:0
作者
Choudhury, Akash Roy [1 ]
Ghosh, Anik [1 ]
Pandey, Rahul [1 ]
Barman, Subhas [1 ]
机构
[1] Jalpaiguri Govt Engn Coll, Dept Comp Sci & Engn, Jalpaiguri, W Bengal, India
来源
PROCEEDINGS OF 2018 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON) | 2018年
关键词
Emotion Recognition; Spectral Features; Prosodic Features; Excitation Source Features; SMO; Random Forest; LINEAR PREDICTION; SPEAKER; CLASSIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The task of recognition of emotions from speech signals is one that has been going on for a long time. In the previous works, the dominance of prosodic and spectral features have been observed when it comes to recognition of emotions. But a speech signal also consists of Source level information which gets lost during this process. In this work, we have combined several spectral features with several excitation source features to see how well the model can perform the emotion recognition task. For the task in hand we have taken 3 databases namely, Berlin Emotional Database (Berlin Emo-DB), Surrey Audio-Visual Expressed Emotion (SAVEE) Database and Toronto emotional speech set (TESS) Database. The reason behind taking these databases is that the variation they offer is effective to judge the robustness of the recognition model. We chose Sequential Minimal Optimization (SMO)and Random Forest to perform classification.
引用
收藏
页码:257 / 261
页数:5
相关论文
共 26 条
[11]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[12]   Support vector machines for speaker and language recognition [J].
Campbell, WM ;
Campbell, JP ;
Reynolds, DA ;
Singer, E ;
Torres-Carrasquillo, PA .
COMPUTER SPEECH AND LANGUAGE, 2006, 20 (2-3) :210-229
[13]   Predicting the Types of Metabolic Pathway of Compounds Using Molecular Fragments and Sequential Minimal Optimization [J].
Chen, Lei ;
Chu, Chen ;
Feng, Kaiyan .
COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2016, 19 (02) :136-143
[14]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[15]   Gene selection and classification of microarray data using random forest -: art. no. 3 [J].
Díaz-Uriarte, R ;
de Andrés, SA .
BMC BIOINFORMATICS, 2006, 7 (1)
[16]  
Hastie T, 1998, ADV NEUR IN, V10, P507
[17]  
Jackson P, 2014, SURREY AUDIO VISUAL
[18]   Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features [J].
Koolagudi, Shashidhar ;
Krothapalli, Sreenivasa .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (04) :495-511
[19]  
Koolagudi SG, 2010, INT CO SIG PROC COMM
[20]   LINEAR PREDICTION - TUTORIAL REVIEW [J].
MAKHOUL, J .
PROCEEDINGS OF THE IEEE, 1975, 63 (04) :561-580