Deep Learning Framework for Speech Emotion Classification: A Survey of the State-of-the-Art

被引:1
作者
Akinpelu, Samson [1 ]
Viriri, Serestina [1 ]
机构
[1] Univ KwaZulu Natal, Sch Math Stat & Comp Sci, ZA-4041 Durban, South Africa
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Deep learning; Feature extraction; Convolutional neural networks; Accuracy; Surveys; Reviews; Neurons; Hidden Markov models; Computer architecture; Emotion recognition; Human computer interaction; Speech recognition; Human-computer interaction; deep learning; speech emotion recognition; convolutional neural networks; vision transformer; mel spectrogram; RECOGNITION;
D O I
10.1109/ACCESS.2024.3474553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intricate landscape of speech emotion classification poses a captivating yet challenging realm due to emotions being fundamental to human communication. In recent years, deep learning frameworks have emerged as powerful tools, shedding light on the elusive domain of emotion recognition, revolutionizing human-computer interactions, and enhancing the emotional intelligence of artificial intelligence (AI). This survey embarks on an exploratory journey into the forefront of deep learning approaches dedicated to speech emotion classification. Deep learning has become the standard approach due to the scarcity of extensive speech corpora and the need for high accuracy at low computational cost. The reason lies in its potency to extract important emotional features from large or medium-sized spectrogram images. Deep learning has been applied to speech emotion classification by many researchers, leading to significant improvements in performance and accuracy. Modern deep learning methods designed for human auditory speech emotion classification are carefully examined in this work. A thorough examination of various deep learning framework designs used in emotion classification is provided, illuminating unique characteristics that capture essential features from speech signals for accurate emotion prediction. The research critically analyzes selected deep models using well-established emotion corpora, highlighting their effectiveness. This research analyses typical performance evaluation metrics used to evaluate speech emotion classification models. With this review, we hope to offer a comprehensive overview of the state-of-the-art, potential directions for further investigation, and developing approaches that further the field of speech emotion classification with deep learning frameworks.
引用
收藏
页码:152152 / 152182
页数:31
相关论文
共 50 条
  • [31] Music Deep Learning: Deep Learning Methods for Music Signal Processing-A Review of the State-of-the-Art
    Moysis, Lazaros
    Iliadis, Lazaros Alexios
    Sotiroudis, Sotirios P.
    Boursianis, Achilles D.
    Papadopoulou, Maria S.
    Kokkinidis, Konstantinos-Iraklis D.
    Volos, Christos
    Sarigiannidis, Panagiotis
    Nikolaidis, Spiridon
    Goudos, Sotirios K.
    IEEE ACCESS, 2023, 11 : 17031 - 17052
  • [32] Deep Learning Approaches Applied to Remote Sensing Datasets for Road Extraction: A State-Of-The-Art Review
    Abdollahi, Abolfazl
    Pradhan, Biswajeet
    Shukla, Nagesh
    Chakraborty, Subrata
    Alamri, Abdullah
    REMOTE SENSING, 2020, 12 (09)
  • [33] A comparative study of state-of-the-art deep learning architectures for rice grain classification
    Farahnakian, Farshad
    Sheikh, Javad
    Farahnakian, Fahimeh
    Heikkonen, Jukka
    JOURNAL OF AGRICULTURE AND FOOD RESEARCH, 2024, 15
  • [34] Bin Picking Approaches Based on Deep Learning Techniques: A State-of-the-Art Survey
    Cordeiro, Artur
    Rocha, Luis F.
    Costa, Carlos
    Costa, Pedro
    Silva, Manuel F.
    2022 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC), 2022, : 110 - 117
  • [35] Deep Learning Applications for COVID-19 Analysis: A State-of-the-Art Survey
    Li, Wenqian
    Deng, Xing
    Shao, Haijian
    Wang, Xia
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2021, 129 (01): : 65 - 98
  • [36] CGBNet: A Deep Learning Framework for Compost Classification
    Gangopadhyay, Suchisrit
    Zhai, Anthony
    IEEE ACCESS, 2022, 10 : 90068 - 90078
  • [37] A state-of-the-art survey of deep learning models for automated pavement crack segmentation
    Gong, Hongren
    Liu, Liming
    Liang, Haimei
    Zhou, Yuhui
    Cong, Lin
    INTERNATIONAL JOURNAL OF TRANSPORTATION SCIENCE AND TECHNOLOGY, 2024, 13 : 44 - 57
  • [38] TSception:A Deep Learning Framework for Emotion Detection Using EEG
    Ding, Yi
    Robinson, Neethu
    Zeng, Qiuhao
    Chen, Duo
    Wai, Aung Aung Phyo
    Lee, Tih-Shih
    Guan, Cuntai
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [39] State-of-the-Art Survey on Deep Learning-Based Recommender Systems for E-Learning
    Salau, Latifat
    Hamada, Mohamed
    Prasad, Rajesh
    Hassan, Mohammed
    Mahendran, Anand
    Watanobe, Yutaka
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [40] Evaluating deep learning architectures for Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    NEURAL NETWORKS, 2017, 92 : 60 - 68