Speech Emotion Recognition Systems: A Comprehensive Review on Different Methodologies

被引:11
作者
Anthony, Audre Arlene [1 ]
Patil, Chandreshekar Mohan [1 ]
机构
[1] Vidyavardhaka Coll Engn, Dept Elect & Commun Engn, Mysuru, India
关键词
Classification; Emotions; Emotion Recognition; MFCC; Neural Netwoks; Speech Emotion Recognition;
D O I
10.1007/s11277-023-10296-5
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
As humans, speech is the common as well as a natural way of expressing ourselves. Speech Emotion Recognition (SER) systems can be defined as an assortment of methods processes and classifies speech signals for the detection of associated emotions. Automatic emotion recognition is the technique of identification of human emotions from various signals like speech, facial expression and text. Collection of such signals and labelling them is often tiresome and needs proficient knowledge. This paper deals with the different types of open source speech emotion datasets of various languages and recent literature survey in the area of speech emotion recognition that employs a number of machine learning approaches with an objective of enhancing the classification accuracy. The paper prudently aims at identifying and synthesizing contemporary pertinent literature associated to the SER systems with different methodologies or design components, thus providing the researchers with an up-to-date understanding of the research topic in the field of SER.
引用
收藏
页码:515 / 525
页数:11
相关论文
共 16 条
[1]  
Absa AHA, 2018, INT MULTICONF SYST, P1241, DOI 10.1109/SSD.2018.8570407
[2]   Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features [J].
Anvarjon, Tursunov ;
Mustaqeem ;
Kwon, Soonil .
SENSORS, 2020, 20 (18) :1-16
[3]  
Aouani Hadhami, 2020, Procedia Computer Science, V176, P251, DOI [10.1016/j.procs.2020.08.027, 10.1016/j.procs.2020.08.027]
[4]   Tomato Leaf Disease Classification using Multiple Feature Extraction Techniques [J].
Basavaiah, Jagadeesh ;
Arlene Anthony, Audre .
WIRELESS PERSONAL COMMUNICATIONS, 2020, 115 (01) :633-651
[5]  
Bharti Deepak, 2020, 2020 International Conference on Smart Electronics and Communication (ICOSEC), P491, DOI 10.1109/ICOSEC49089.2020.9215376
[6]   Multi-Modal Emotion Recognition From Speech and Facial Expression Based on Deep Learning [J].
Cai, Linqin ;
Dong, Jiangong ;
Wei, Min .
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, :5726-5729
[7]  
Cheng HH, 2020, 2020 IEEE 3RD INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP 2020), P163, DOI [10.1109/ICICSP50920.2020.9232071, 10.1109/icicsp50920.2020.9232071]
[8]  
Cornejo JYR, 2019, IEEE SYS MAN CYBERN, P3396, DOI [10.1109/smc.2019.8914193, 10.1109/SMC.2019.8914193]
[9]   Speech Emotion Recognition UsingConvolutional Neural Network and Long-Short TermMemory [J].
Dangol, Ranjana ;
Alsadoon, Abeer ;
Prasad, P. W. C. ;
Seher, Indra ;
Alsadoon, Omar Hisham .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) :32917-32934
[10]  
Hasan H. M. Mahmudul, 2020, 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT), P1131, DOI 10.1109/ICSSIT48917.2020.9214196