Emotion recognition from speech: a review

被引:183
|
作者
Koolagudi, Shashidhar G. [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur 721302, W Bengal, India
关键词
Emotion recognition; Simulated emotional speech corpus; Elicited speech corpus; Natural speech corpus; Excitation source features; System features; Prosodic features; Classification models;
D O I
10.1007/s10772-011-9125-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotion recognition from speech has emerged as an important research area in the recent past. In this regard, review of existing work on emotional speech processing is useful for carrying out further research. In this paper, the recent literature on speech emotion recognition has been presented considering the issues related to emotional speech corpora, different types of speech features and models used for recognition of emotions from speech. Thirty two representative speech databases are reviewed in this work from point of view of their language, number of speakers, number of emotions, and purpose of collection. The issues related to emotional speech databases used in emotional speech recognition are also briefly discussed. Literature on different features used in the task of emotion recognition from speech is presented. The importance of choosing different classification models has been discussed along with the review. The important issues to be considered for further emotion recognition research in general and in specific to the Indian context have been highlighted where ever necessary.
引用
收藏
页码:99 / 117
页数:19
相关论文
共 50 条
  • [1] Databases, features and classifiers for speech emotion recognition: a review
    Swain, Monorama
    Routray, Aurobinda
    Kabisatpathy, P.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (01) : 93 - 120
  • [2] Emotion recognition from speech using source, system, and prosodic features
    Koolagudi, Shashidhar G.
    Rao, K. Sreenivasa
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 265 - 289
  • [3] A review on emotion recognition from dialect speech using feature optimization and classification techniques
    Thimmaiah, Sunil
    Vinay, N. A.
    Ravikumar, M. G.
    Prasad, S. R.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73793 - 73793
  • [4] A Comprehensive Review of Speech Emotion Recognition Systems
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Kartiwi, Mira
    Ambikairajah, Eliathamby
    IEEE ACCESS, 2021, 9 : 47795 - 47814
  • [5] Emotion Recognition Through Analysis of Speech - A Review
    Poyraz, Rasim Atakan
    Suvarna, Prajyot
    Iliev, Alexander I.
    DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2024, 14 : 227 - 238
  • [6] Robust recognition of emotion from speech
    Hoque, Mohammed E.
    Yeasin, Mohammed
    Louwerse, Max M.
    INTELLIGENT VIRTUAL AGENTS, PROCEEDINGS, 2006, 4133 : 42 - 53
  • [7] Emotion Recognition from Speech: A Survey
    Drakopoulos, Georgios
    Pikramenos, George
    Spyrou, Evaggelos
    Perantonis, Stavros J.
    WEBIST: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2019, : 432 - 439
  • [8] Emotion Recognition from Speech Signal
    Ramdinmawii, Esther
    Mohanta, Abhijit
    Mittal, Vinay Kumar
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1562 - 1567
  • [9] Emotion Recognition from Speech Signals using Excitation Source and Spectral Features
    Choudhury, Akash Roy
    Ghosh, Anik
    Pandey, Rahul
    Barman, Subhas
    PROCEEDINGS OF 2018 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON), 2018, : 257 - 261
  • [10] Biologically inspired emotion recognition from speech
    Laura Caponetti
    Cosimo Alessandro Buscicchio
    Giovanna Castellano
    EURASIP Journal on Advances in Signal Processing, 2011