Detection and Analysis of Emotion From Speech Signals

被引:35
作者
Davletcharova, Assel [1 ]
Sugathan, Sherin [2 ]
Abraham, Bibia [3 ]
James, Alex Pappachen [1 ,2 ]
机构
[1] Nazarbayev Univ, Dept Elect & Elect Engn, Astana, Kazakhstan
[2] Enview Res & Dev Labs, Trivandrum, Kerala, India
[3] Med Coll Hosp, Kannur, India
来源
SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15) | 2015年 / 58卷
关键词
Emotion Analysis; Emotion Classification; Speech Processing; Mel-Frequency Cepstral Coefficients; RECOGNITION;
D O I
10.1016/j.procs.2015.08.032
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recognizing emotion from speech has become one the active research themes in speech processing and in applications based on human-computer interaction. This paper conducts an experimental study on recognizing emotions from human speech. The emotions considered for the experiments include neutral, anger, joy and sadness. The distinuishability of emotional features in speech were studied first followed by emotion classification performed on a custom dataset. The classification was performed for different classifiers. One of the main feature attribute considered in the prepared dataset was the peak-to-peak distance obtained from the graphical representation of the speech signals. After performing the classification tests on a dataset formed from 30 different subjects, it was found that for getting better accuracy, one should consider the data collected from one person rather than considering the data from a group of people. (C) 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:91 / 96
页数:6
相关论文
共 15 条
[1]  
Abernethy M., DATA MINING WEKA 2
[2]  
Albu F., 9 INT TECHN ED DEV C, V1
[3]  
[Anonymous], 2006, PLP RASTA MFCC INVER
[4]  
[Anonymous], 2000, ISMIR
[5]  
Anttonen J., 2005, CHI 2005 TECHNOLOGY, P491, DOI [10.1145/1054972.1055040, DOI 10.1145/1054972.1055040]
[6]   A large-scale evaluation of acoustic and subjective music-similarity measures [J].
Berenzweig, A ;
Logan, B ;
Ellis, DPW ;
Whitman, B .
COMPUTER MUSIC JOURNAL, 2004, 28 (02) :63-76
[7]   Noninvasive technique for detecting hypernasal speech using a nonlinear operator [J].
Cairns, DA ;
Hansen, JHL ;
Riski, JE .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1996, 43 (01) :35-45
[8]  
Gamasu R.., 2014, INT J BIOSCI BIOTECH, V6, P83, DOI [10.14257/ijbsbt.2014.6. 1.09, DOI 10.14257/IJBSBT.2014.6.1.09]
[9]   Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection [J].
GavidiaCeballos, L ;
Hansen, JHL .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1996, 43 (04) :373-383
[10]   Emotion recognition system using short-term monitoring of physiological signals [J].
Kim, KH ;
Bang, SW ;
Kim, SR .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2004, 42 (03) :419-427