Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers

被引:382
作者
Akcay, Mehmet Berkehan [1 ]
Oguz, Kaya [2 ]
机构
[1] Izmir Univ Econ, Dept Software Engn, Izmir, Turkey
[2] Izmir Univ Econ, Dept Comp Engn, Izmir, Turkey
关键词
Speech emotion recognition; Survey; Speech features; Classification; Speech databases; COMMUNICATING EMOTION; SPECTRAL FEATURES; NEURAL-NETWORKS; VOICE QUALITY; CLASSIFICATION; EXPRESSION; VALENCE; ADVERSARIAL; AROUSAL; AUDIO;
D O I
10.1016/j.specom.2019.12.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech is the most natural way of expressing ourselves as humans. It is only natural then to extend this communication medium to computer applications. We define speech emotion recognition (SER) systems as a collection of methodologies that process and classify speech signals to detect the embedded emotions. SER is not a new field, it has been around for over two decades, and has regained attention thanks to the recent advancements. These novel studies make use of the advances in all fields of computing and technology, making it necessary to have an update on the current methodologies and techniques that make SER possible. We have identified and discussed distinct areas of SER, provided a detailed survey of current literature of each, and also listed the current challenges.
引用
收藏
页码:56 / 76
页数:21
相关论文
共 146 条
[1]   Domain Adversarial for Acoustic Emotion Recognition [J].
Abdelwahab, Mohammed ;
Busso, Carlos .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) :2423-2435
[2]   Spoken emotion recognition using hierarchical classifiers [J].
Albornoz, Enrique M. ;
Milone, Diego H. ;
Rufiner, Hugo L. .
COMPUTER SPEECH AND LANGUAGE, 2011, 25 (03) :556-570
[3]   Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011 [J].
Anagnostopoulos, Christos-Nikolaos ;
Iliou, Theodoros ;
Giannoukos, Ioannis .
ARTIFICIAL INTELLIGENCE REVIEW, 2015, 43 (02) :155-177
[4]  
[Anonymous], 2004, P 6 INT C MULT INT, DOI DOI 10.1145/1027933.1027968
[5]  
[Anonymous], SPEECH ENHANCEMENT M
[6]  
[Anonymous], 1997, P 5 EUROPEAN C SPEEC, DOI DOI 10.21437/EUROSPEECH.1997-494
[7]  
[Anonymous], 1997, Proceedings of EuroSpeech
[8]  
[Anonymous], 4 INT C SPOK LANG PR
[9]  
[Anonymous], SPEECH RECOGNITION
[10]  
[Anonymous], 1971, NEBRASKA S MOTIVATIO, DOI DOI 10.1037/0022-3514.53.4.712