Deep ensemble models for speech emotion classification

被引：4

作者：

Pravin, Sheena Christabel ^{[1
]}

Sivaraman, Vishal Balaji ^{[2
]}

Saranya, J. ^{[3
]}

机构：

[1] Vellore Inst Technol, Sch Elect Engn SENSE, Chennai, India

[2] Univ Florida, Gainesville, FL USA

[3] Rajalakshmi Engn Coll, Thandalam, India

来源：

MICROPROCESSORS AND MICROSYSTEMS | 2023年 / 98卷

关键词：

Deep cascaded ensemble; Deep parallel ensemble; Speech emotion classification; Memory consumption and run time complexity; RECOGNITION;

D O I：

10.1016/j.micpro.2023.104790

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This research article proposes two deep ensemble models, namely the Deep Cascaded Ensemble (DCE) and Deep Parallel Ensemble (DPE) for automatic speech emotion classification. Classification of emotions into their respective classes has so long relied on machine learning and deep learning networks. The proposed models are a blend of different machine learning and deep learning models in a cascaded/parallel architecture. The proposed models have exhibited a considerable reduction in the consumption of memory in a Google Colab environment. Furthermore, the issues of tuning numerous hyper-parameters and the huge data demand of the deep learning algorithms are overcome by the proposed deep ensemble models. The proposed DCE and DPE have yielded optimal classification accuracy with reduced consumption of memory over less data. Experimentally, the pro-posed deep cascaded ensemble has superior performance compared to the deep parallel ensemble of the same combination of deep learning and machine learning networks as well. The proposed models and the baseline models were evaluated in terms of possible performance metrics including Cohen's kappa coefficient, accuracy of classification accuracy, space and time complexity.

引用

页数：9

共 50 条

[41] Speech emotion classification using fractal dimension-based features
Tamulevicius, Gintautas
Karbauskaite, Rasa
Dzemyda, Gintautas
NONLINEAR ANALYSIS-MODELLING AND CONTROL, 2019, 24 (05): : 679 - 695
[42] Speech Emotion Classification via a Modified Gaussian Mixture Model Approach
Hosseini, Zeinab
Ahadi, Seyed Mohammad
Faraji, Neda
2014 7TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2014, : 487 - 491
[43] Negative emotion speech classification for a six-leg rescue robot
Han, Xiaolei
CIVIL, ARCHITECTURE AND ENVIRONMENTAL ENGINEERING, VOLS 1 AND 2, 2017, : 1373 - 1378
[44] Empirical evaluation of emotion classification accuracy for non-acted speech
Deshpande, Gauri
Viraraghavan, Venkata Subramanian
Duggirala, Mayuri
Reddy, V. Ramu
Patel, Sachin
2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
[45] A Distributed Ensemble Machine Learning Technique for Emotion Classification from Vocal Cues
Vijayan, Bineetha
Soman, Gayathri
Vivek, M. V.
Judy, M. V.
BIG DATA ANALYTICS, BDA 2022, 2022, 13773 : 136 - 145
[46] Explaining deep learning models for speech enhancement
Sivasankaran, Sunit
Vincent, Emmanuel
Fohr, Dominique
INTERSPEECH 2021, 2021, : 696 - 700
[47] Ensemble classification from deep predictions with test data augmentation
Calvo-Zaragoza, Jorge
Rico-Juan, Juan R.
Gallego, Antonio-Javier
SOFT COMPUTING, 2020, 24 (02) : 1423 - 1433
[48] Emotion Classification Using Deep Neural Networks and Emotional Patches
Huang, Jungming
Xu, Xiangmin
Zhang, Tong
2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 958 - 962
[49] Complementary models for audio-visual speech classification
Sad, Gonzalo D.
Terissi, Lucas D.
Gomez, Juan C.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 231 - 249
[50] 3DACRNN Model Based on Residual Network for Speech Emotion Classification
Hu, Zhangfang
Tang, Shanshan
Luo, Yuan
Jian, Fang
Si, Xingtong
ENGINEERING LETTERS, 2021, 29 (02) : 400 - 407

← 1 2 3 4 5 →