Deep ensemble models for speech emotion classification

被引:4
作者
Pravin, Sheena Christabel [1 ]
Sivaraman, Vishal Balaji [2 ]
Saranya, J. [3 ]
机构
[1] Vellore Inst Technol, Sch Elect Engn SENSE, Chennai, India
[2] Univ Florida, Gainesville, FL USA
[3] Rajalakshmi Engn Coll, Thandalam, India
关键词
Deep cascaded ensemble; Deep parallel ensemble; Speech emotion classification; Memory consumption and run time complexity; RECOGNITION;
D O I
10.1016/j.micpro.2023.104790
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This research article proposes two deep ensemble models, namely the Deep Cascaded Ensemble (DCE) and Deep Parallel Ensemble (DPE) for automatic speech emotion classification. Classification of emotions into their respective classes has so long relied on machine learning and deep learning networks. The proposed models are a blend of different machine learning and deep learning models in a cascaded/parallel architecture. The proposed models have exhibited a considerable reduction in the consumption of memory in a Google Colab environment. Furthermore, the issues of tuning numerous hyper-parameters and the huge data demand of the deep learning algorithms are overcome by the proposed deep ensemble models. The proposed DCE and DPE have yielded optimal classification accuracy with reduced consumption of memory over less data. Experimentally, the pro-posed deep cascaded ensemble has superior performance compared to the deep parallel ensemble of the same combination of deep learning and machine learning networks as well. The proposed models and the baseline models were evaluated in terms of possible performance metrics including Cohen's kappa coefficient, accuracy of classification accuracy, space and time complexity.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Speech Emotion Classification Using Deep Learning
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 19 - 31
  • [2] A Hybrid Deep Ensemble for Speech Disfluency Classification
    Pravin, Sheena Christabel
    Palanivelan, M.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (08) : 3968 - 3995
  • [3] Speech Based Multiple Emotion Classification Model Using Deep Learning
    Patneedi, Shakti Swaroop
    Kumari, Nandini
    ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 648 - 659
  • [5] Emotion Profile Refinery for Speech Emotion Classification
    Mao, Shuiyang
    Ching, P. C.
    Lee, Tan
    INTERSPEECH 2020, 2020, : 531 - 535
  • [6] Exploration of Phase Information for Speech Emotion Classification
    Deb, Suman
    Dandapat, S.
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [7] Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification
    Yun, Sungrack
    Yoo, Chang D.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 585 - 598
  • [8] Citrus pests classification using an ensemble of deep learning models
    Khanramaki, Morteza
    Asli-Ardeh, Ezzatollah Askari
    Kozegar, Ehsan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 186
  • [9] Deep Learning Framework for Speech Emotion Classification: A Survey of the State-of-the-Art
    Akinpelu, Samson
    Viriri, Serestina
    IEEE ACCESS, 2024, 12 : 152152 - 152182
  • [10] Real Life Emotion Classification from Speech Using Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Barthwal, Anurag
    Devliyal, Swati
    Rao, K. Sreenivasa
    CONTEMPORARY COMPUTING, 2012, 306 : 250 - +