Deep ensemble models for speech emotion classification

被引：4

作者：

Pravin, Sheena Christabel ^{[1
]}

Sivaraman, Vishal Balaji ^{[2
]}

Saranya, J. ^{[3
]}

机构：

[1] Vellore Inst Technol, Sch Elect Engn SENSE, Chennai, India

[2] Univ Florida, Gainesville, FL USA

[3] Rajalakshmi Engn Coll, Thandalam, India

来源：

MICROPROCESSORS AND MICROSYSTEMS | 2023年 / 98卷

关键词：

Deep cascaded ensemble; Deep parallel ensemble; Speech emotion classification; Memory consumption and run time complexity; RECOGNITION;

D O I：

10.1016/j.micpro.2023.104790

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This research article proposes two deep ensemble models, namely the Deep Cascaded Ensemble (DCE) and Deep Parallel Ensemble (DPE) for automatic speech emotion classification. Classification of emotions into their respective classes has so long relied on machine learning and deep learning networks. The proposed models are a blend of different machine learning and deep learning models in a cascaded/parallel architecture. The proposed models have exhibited a considerable reduction in the consumption of memory in a Google Colab environment. Furthermore, the issues of tuning numerous hyper-parameters and the huge data demand of the deep learning algorithms are overcome by the proposed deep ensemble models. The proposed DCE and DPE have yielded optimal classification accuracy with reduced consumption of memory over less data. Experimentally, the pro-posed deep cascaded ensemble has superior performance compared to the deep parallel ensemble of the same combination of deep learning and machine learning networks as well. The proposed models and the baseline models were evaluated in terms of possible performance metrics including Cohen's kappa coefficient, accuracy of classification accuracy, space and time complexity.

引用

页数：9

共 50 条

[1] Speaker independent speech emotion recognition by ensemble classification
Schuller, B
Reiter, S
Müller, R
Al-Hames, M
Lang, M
Rigoll, G
2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 865 - 868
[2] Ensemble deep learning with HuBERT for speech emotion recognition
Yang, Janghoon
2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 153 - 154
[3] Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques
Mihalache, Serban
Burileanu, Dragos
ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2023, 26 (3-4): : 375 - 387
[4] Speech Emotion Classification Using Deep Learning
Mishra, Siba Prasad
Warule, Pankaj
Deb, Suman
PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 19 - 31
[5] A Hybrid Deep Ensemble for Speech Disfluency Classification
Sheena Christabel Pravin
M. Palanivelan
Circuits, Systems, and Signal Processing, 2021, 40 : 3968 - 3995
[6] A Hybrid Deep Ensemble for Speech Disfluency Classification
Pravin, Sheena Christabel
Palanivelan, M.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (08) : 3968 - 3995
[7] Speech emotion recognition: Features and classification models
Chen, Lijiang
Mao, Xia
Xue, Yuli
Cheng, Lee Lung
DIGITAL SIGNAL PROCESSING, 2012, 22 (06) : 1154 - 1160
[8] Investigation of Ensemble of Self-Supervised Models for Speech Emotion Recognition
Wu, Yanfeng
Yue, Pengcheng
Cheng, Cuiping
Li, Taihao
2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 988 - 995
[9] Improved Speech Emotion Classification Using Deep Neural Network
Mariwan Hama Saeed
Circuits, Systems, and Signal Processing, 2023, 42 : 7357 - 7376
[10] Improved Speech Emotion Classification Using Deep Neural Network
Saeed, Mariwan Hama
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (12) : 7357 - 7376

← 1 2 3 4 5 →