Emotion recognition from speech: a review

被引:183
|
作者
Koolagudi, Shashidhar G. [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur 721302, W Bengal, India
关键词
Emotion recognition; Simulated emotional speech corpus; Elicited speech corpus; Natural speech corpus; Excitation source features; System features; Prosodic features; Classification models;
D O I
10.1007/s10772-011-9125-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotion recognition from speech has emerged as an important research area in the recent past. In this regard, review of existing work on emotional speech processing is useful for carrying out further research. In this paper, the recent literature on speech emotion recognition has been presented considering the issues related to emotional speech corpora, different types of speech features and models used for recognition of emotions from speech. Thirty two representative speech databases are reviewed in this work from point of view of their language, number of speakers, number of emotions, and purpose of collection. The issues related to emotional speech databases used in emotional speech recognition are also briefly discussed. Literature on different features used in the task of emotion recognition from speech is presented. The importance of choosing different classification models has been discussed along with the review. The important issues to be considered for further emotion recognition research in general and in specific to the Indian context have been highlighted where ever necessary.
引用
收藏
页码:99 / 117
页数:19
相关论文
共 50 条
  • [41] A Hierarchical Approach with Feature Selection for Emotion Recognition from Speech
    Giannoulis, Panagiotis
    Potamianos, Gerasimos
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1203 - 1206
  • [42] Application of Vector Quantization in Emotion Recognition from Human Speech
    Khanna, Preeti
    Kumar, M. Sasi
    INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT, 2011, 141 : 118 - +
  • [43] Emotion Recognition from Speech using Prosodic and Linguistic Features
    Pervaiz, Mahwish
    Khan, Tamim Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
  • [44] Arabic Speech Emotion Recognition From Saudi Dialect Corpus
    Aljuhani, Reem Hamed
    Alshutayri, Areej
    Alahdal, Shahd
    IEEE ACCESS, 2021, 9 : 127081 - 127085
  • [45] Acoustic feature selection for automatic emotion recognition from speech
    Rong, Jia
    Li, Gang
    Chen, Yi-Ping Phoebe
    INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (03) : 315 - 328
  • [46] Enhancing Emotion Recognition from Speech through Feature Selection
    Kostoulas, Theodoros
    Ganchev, Todor
    Lazaridis, Alexandros
    Fakotakis, Nikos
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 338 - 344
  • [47] Emotion recognition and school violence detection from children speech
    Han, Tian
    Zhang, Jincheng
    Zhang, Zhu
    Sun, Guobing
    Ye, Liang
    Ferdinando, Hany
    Alasaarela, Esko
    Seppanen, Tapio
    Yu, Xiaoyang
    Yang, Shuchang
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
  • [48] Speech Emotion Recognition using DWT
    Lalitha, S.
    Mudupu, Anoop
    Nandyala, Bala Visali
    Munagala, Renuka
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2015, : 20 - 23
  • [49] Speech Emotion Recognition: A Comprehensive Survey
    Mohammed Jawad Al-Dujaili
    Abbas Ebrahimi-Moghadam
    Wireless Personal Communications, 2023, 129 : 2525 - 2561
  • [50] The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech
    Philippou-Huebner, David
    Vlasenko, Bogdan
    Boeck, Ronald
    Wendemuth, Andreas
    2012 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2012, : 296 - 301