Speech Emotion Recognition: A Comprehensive Survey

被引:0
作者
Mohammed Jawad Al-Dujaili
Abbas Ebrahimi-Moghadam
机构
[1] University of Kufa,Departement of Electronic and Communication, Faculty of Engineering
[2] Ferdowsi University of Mashhad,Electrical Engineering Department Faculty of Engineering
来源
Wireless Personal Communications | 2023年 / 129卷
关键词
Speech; Emotion recognition; Feature extraction; Feature reduction; Classification; Classification composition;
D O I
暂无
中图分类号
学科分类号
摘要
Speech emotion recognition could be considered a new topic in speech processing where he plays that plays an essential role in human interaction. Emotions are a king of speech that recognizes the three significant aspects of designing the speech emotion recognition system. This article reviews the work on speech emotion recognition and is helpful for further research. Firstly, speech emotion recognition databases are described for evaluating system performance. Secondly, the choice of feature is presented in the speech representation. And third is the design of a suitable class. While the section fourth explains the multiple classifier system and its impact on system. In the fifth part of the article, we review the most important challenges in the system speech emotion recognition. The final results obtained from the system function and its constraints are discussed, and we provide directions to improve speech emotion recognition systems.
引用
收藏
页码:2525 / 2561
页数:36
相关论文
共 252 条
[1]  
Nicholson J(2000)Emotion recognition in speech using neural networks Neural Computing & Applications 9 290-296
[2]  
Takahashi K(2014)Emotion recognition in human computer interaction systems Elektronika ir Elektrotechnika 20 51-56
[3]  
Nakatsu R(2008)Acoustical properties of speech as indicators of depression and suicidal risk Vanderbilt Undergraduate Research Journal 4 66-100
[4]  
Mikuckas A(2010)Modulation spectral features for robust far-field speaker identification IEEE Transactions on Audio, Speech, and Language Processing 18 90-76
[5]  
Mikuckiene I(2020)Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers Speech Communication 116 56-1156
[6]  
Venckauskas A(2013)Sex differences in the timing of identification among children and adults with autism spectrum disorders Journal of Autism and Developmental Disorders 43 1151-1629
[7]  
Kazanavicius E(2019)Fusing MFCC and LPC features using 1D triplet CNN for speaker recognition in severely degraded audio signals IEEE Transactions on Information Forensics and Security 15 1616-13
[8]  
Lukas R(2020)Call redistribution for a call center based on speech emotion recognition Applied Sciences 10 4653-924
[9]  
Plauska I(2009)Vocal emotion recognition in five native languages of Assam using new wavelet features International Journal of Speech Technology 12 1-313
[10]  
Landau MJ(2017)CHEAVD: A Chinese natural emotional audio–visual database Journal of Ambient Intelligence and Humanized Computing 8 913-76