Deep Learning Framework for Speech Emotion Classification: A Survey of the State-of-the-Art

被引:1
|
作者
Akinpelu, Samson [1 ]
Viriri, Serestina [1 ]
机构
[1] Univ KwaZulu Natal, Sch Math Stat & Comp Sci, ZA-4041 Durban, South Africa
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Deep learning; Feature extraction; Convolutional neural networks; Accuracy; Surveys; Reviews; Neurons; Hidden Markov models; Computer architecture; Emotion recognition; Human computer interaction; Speech recognition; Human-computer interaction; deep learning; speech emotion recognition; convolutional neural networks; vision transformer; mel spectrogram; RECOGNITION;
D O I
10.1109/ACCESS.2024.3474553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intricate landscape of speech emotion classification poses a captivating yet challenging realm due to emotions being fundamental to human communication. In recent years, deep learning frameworks have emerged as powerful tools, shedding light on the elusive domain of emotion recognition, revolutionizing human-computer interactions, and enhancing the emotional intelligence of artificial intelligence (AI). This survey embarks on an exploratory journey into the forefront of deep learning approaches dedicated to speech emotion classification. Deep learning has become the standard approach due to the scarcity of extensive speech corpora and the need for high accuracy at low computational cost. The reason lies in its potency to extract important emotional features from large or medium-sized spectrogram images. Deep learning has been applied to speech emotion classification by many researchers, leading to significant improvements in performance and accuracy. Modern deep learning methods designed for human auditory speech emotion classification are carefully examined in this work. A thorough examination of various deep learning framework designs used in emotion classification is provided, illuminating unique characteristics that capture essential features from speech signals for accurate emotion prediction. The research critically analyzes selected deep models using well-established emotion corpora, highlighting their effectiveness. This research analyses typical performance evaluation metrics used to evaluate speech emotion classification models. With this review, we hope to offer a comprehensive overview of the state-of-the-art, potential directions for further investigation, and developing approaches that further the field of speech emotion classification with deep learning frameworks.
引用
收藏
页码:152152 / 152182
页数:31
相关论文
共 50 条
  • [21] Kidney Tumor Semantic Segmentation Using Deep Learning: A Survey of State-of-the-Art
    Abdelrahman, Abubaker
    Viriri, Serestina
    JOURNAL OF IMAGING, 2022, 8 (03)
  • [22] Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques
    Mihalache, Serban
    Burileanu, Dragos
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2023, 26 (3-4): : 375 - 387
  • [23] Deep learning-based classification, detection, and segmentation of tomato leaf diseases: A state-of-the-art review
    Das, Aritra
    Pathan, Fahad
    Jim, Jamin Rahman
    Kabir, Md Mohsin
    Mridha, M. F.
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2025, 15 (02): : 192 - 220
  • [24] Speech Emotion Classification Using Deep Learning
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 19 - 31
  • [25] State-of-the-Art Versus Deep Learning: A Comparative Study of Motor Imagery Decoding Techniques
    George, Olawunmi
    Dabas, Sarthak
    Sikder, Abdur
    Smith, Roger O.
    Madiraju, Praveen
    Yahyasoltani, Nasim
    Ahamed, Sheikh Iqbal
    IEEE ACCESS, 2022, 10 : 45605 - 45619
  • [26] State-of-the-Art Analysis of Deep Learning-Based Monaural Speech Source Separation Techniques
    Soni, Swati
    Yadav, Ram Narayan
    Gupta, Lalita
    IEEE ACCESS, 2023, 11 : 4242 - 4269
  • [27] Deep Learning for Visual Speech Analysis: A Survey
    Sheng, Changchong
    Kuang, Gangyao
    Bai, Liang
    Hou, Chenping
    Guo, Yulan
    Xu, Xin
    Pietikainen, Matti
    Liu, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6001 - 6022
  • [28] Review the state-of-the-art technologies of semantic segmentation based on deep learning
    Mo, Yujian
    Wu, Yan
    Yang, Xinneng
    Liu, Feilin
    Liao, Yujun
    NEUROCOMPUTING, 2022, 493 : 626 - 646
  • [29] A Survey of Deep Learning for Lung Disease Detection on Medical Images: State-of-the-Art, Taxonomy, Issues and Future Directions
    Kieu, Stefanus Tao Hwa
    Bade, Abdullah
    Hijazi, Mohd Hanafi Ahmad
    Kolivand, Hoshang
    JOURNAL OF IMAGING, 2020, 6 (12)
  • [30] State-of-the-Art Deep Learning Algorithms for Internet of Things-Based Detection of Crop Pests and Diseases: A Comprehensive Review
    Nyakuri, Jean Pierre
    Nkundineza, Celestin
    Gatera, Omar
    Nkurikiyeyezu, Kizito
    IEEE ACCESS, 2024, 12 : 169824 - 169849