Speech emotion recognition of Hindi speech using statistical and machine learning techniques

被引:6
作者
Agrawal, Akshat [1 ]
Jain, Anurag [1 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, Univ Sch Informat Commun & Technol, Sect 16C, New Delhi 110078, India
关键词
Speech Emotion Recognition; Statistical approach; KNN; Naive Bayes; Prosodic features; CLASSIFICATION; FEATURES;
D O I
10.1080/09720502.2020.1721926
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Now a days speech is fastest medium of giving instruction to machines to do any task. When a person uttered a word at that time machine can understand the semantic of the utterance but not the emotion related with that utterance. This study mainly focused on combining on different types of speech features together and then this paper used various statistical techniques for reducing the dimensionality of data and then applying machine learning algorithm to train dataset and predicting the emotional state of the person so that while receiving the instruction form humans, machines can provide better response for recognizing emotion.
引用
收藏
页码:311 / 319
页数:9
相关论文
共 25 条
[1]  
Agrawal A., 2019, DEEP LEARNING BASED
[2]  
Agrawal SS, 2010, INT J COMPUT SCI NET, V10, P198
[3]  
[Anonymous], P WORKSH APPL INF TE
[4]  
[Anonymous], 1990, Journal of the American Voice I/O Society
[5]  
Benzooijen R., 1984, MOTIV EMOTION, V15, P123
[6]   NONLINEAR-ANALYSIS AND CLASSIFICATION OF SPEECH UNDER STRESSED CONDITIONS [J].
CAIRNS, DA ;
HANSEN, JHL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (06) :3392-3400
[7]   A Study on Paraphrase Recognition Using Radial Basis Function Neural Network [J].
Chitra, A. ;
Rajkumar, Anupriya .
IETE JOURNAL OF RESEARCH, 2012, 58 (01) :50-56
[8]   Describing the emotional states that are expressed in speech [J].
Cowie, R ;
Cornelius, RR .
SPEECH COMMUNICATION, 2003, 40 (1-2) :5-32
[9]  
Cowie R, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1989, DOI 10.1109/ICSLP.1996.608027
[10]  
Cowie R., 2000, ISCA ITRW SPEECH EMO, P224