Deep ensemble models for speech emotion classification

被引:4
作者
Pravin, Sheena Christabel [1 ]
Sivaraman, Vishal Balaji [2 ]
Saranya, J. [3 ]
机构
[1] Vellore Inst Technol, Sch Elect Engn SENSE, Chennai, India
[2] Univ Florida, Gainesville, FL USA
[3] Rajalakshmi Engn Coll, Thandalam, India
关键词
Deep cascaded ensemble; Deep parallel ensemble; Speech emotion classification; Memory consumption and run time complexity; RECOGNITION;
D O I
10.1016/j.micpro.2023.104790
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This research article proposes two deep ensemble models, namely the Deep Cascaded Ensemble (DCE) and Deep Parallel Ensemble (DPE) for automatic speech emotion classification. Classification of emotions into their respective classes has so long relied on machine learning and deep learning networks. The proposed models are a blend of different machine learning and deep learning models in a cascaded/parallel architecture. The proposed models have exhibited a considerable reduction in the consumption of memory in a Google Colab environment. Furthermore, the issues of tuning numerous hyper-parameters and the huge data demand of the deep learning algorithms are overcome by the proposed deep ensemble models. The proposed DCE and DPE have yielded optimal classification accuracy with reduced consumption of memory over less data. Experimentally, the pro-posed deep cascaded ensemble has superior performance compared to the deep parallel ensemble of the same combination of deep learning and machine learning networks as well. The proposed models and the baseline models were evaluated in terms of possible performance metrics including Cohen's kappa coefficient, accuracy of classification accuracy, space and time complexity.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Speech emotion classification using fractal dimension-based features
    Tamulevicius, Gintautas
    Karbauskaite, Rasa
    Dzemyda, Gintautas
    NONLINEAR ANALYSIS-MODELLING AND CONTROL, 2019, 24 (05): : 679 - 695
  • [42] Speech Emotion Classification via a Modified Gaussian Mixture Model Approach
    Hosseini, Zeinab
    Ahadi, Seyed Mohammad
    Faraji, Neda
    2014 7TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2014, : 487 - 491
  • [43] Negative emotion speech classification for a six-leg rescue robot
    Han, Xiaolei
    CIVIL, ARCHITECTURE AND ENVIRONMENTAL ENGINEERING, VOLS 1 AND 2, 2017, : 1373 - 1378
  • [44] Empirical evaluation of emotion classification accuracy for non-acted speech
    Deshpande, Gauri
    Viraraghavan, Venkata Subramanian
    Duggirala, Mayuri
    Reddy, V. Ramu
    Patel, Sachin
    2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [45] A Distributed Ensemble Machine Learning Technique for Emotion Classification from Vocal Cues
    Vijayan, Bineetha
    Soman, Gayathri
    Vivek, M. V.
    Judy, M. V.
    BIG DATA ANALYTICS, BDA 2022, 2022, 13773 : 136 - 145
  • [46] Explaining deep learning models for speech enhancement
    Sivasankaran, Sunit
    Vincent, Emmanuel
    Fohr, Dominique
    INTERSPEECH 2021, 2021, : 696 - 700
  • [47] Ensemble classification from deep predictions with test data augmentation
    Calvo-Zaragoza, Jorge
    Rico-Juan, Juan R.
    Gallego, Antonio-Javier
    SOFT COMPUTING, 2020, 24 (02) : 1423 - 1433
  • [48] Emotion Classification Using Deep Neural Networks and Emotional Patches
    Huang, Jungming
    Xu, Xiangmin
    Zhang, Tong
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 958 - 962
  • [49] Complementary models for audio-visual speech classification
    Sad, Gonzalo D.
    Terissi, Lucas D.
    Gomez, Juan C.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 231 - 249
  • [50] 3DACRNN Model Based on Residual Network for Speech Emotion Classification
    Hu, Zhangfang
    Tang, Shanshan
    Luo, Yuan
    Jian, Fang
    Si, Xingtong
    ENGINEERING LETTERS, 2021, 29 (02) : 400 - 407