Emotion recognition from speech signals using digital features optimization by diversity measure fusion

被引：0

作者：

Konduru, Ashok Kumar ^{[1
]}

Iqbal, J. L. Mazher ^{[2
]}

机构：

[1] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, Chennai, Tamil Nadu, India

[2] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, ECE, Chennai, Tamil Nadu, India

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2024年 / 46卷 / 01期

关键词：

Hidden markov model; emotion detection; speech signal; artificial intelligence; cuckoo search; distributed diversity measures; FEATURE-SELECTION; ALGORITHM; NETWORKS;

D O I：

10.3233/JIFS-231263

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Emotion recognition from speech signals serves a crucial role in human-computer interaction and behavioral studies. The task, however, presents significant challenges due to the high dimensionality and noisy nature of speech data. This article presents a comprehensive study and analysis of a novel approach, "Digital Features Optimization by Diversity Measure Fusion (DFOFDM)", aimed at addressing these challenges. The paper begins by elucidating the necessity for improved emotion recognition methods, followed by a detailed introduction to DFOFDM. This approach employs acoustic and spectral features from speech signals, coupled with an optimized feature selection process using a fusion of diversity measures. The study's central method involves a Cuckoo Search-based classification strategy, which is tailored for this multi-label problem. The performance of the proposed DFOFDM approach is evaluated extensively. Emotion labels such as 'Angry', 'Happy', and 'Neutral' showed a precision rate over 92%, while other emotions fell within the range of 87% to 90%. Similar performance was observed in terms of recall, with most emotions falling within the 90% to 95% range. The F-Score, another crucial metric, also reflected comparable statistics for each label. Notably, the DFOFDM model showed resilience to label imbalances and noise in speech data, crucial for real-world applications. When compared with a contemporary model, "Transfer Subspace Learning by Least Square Loss (TSLSL)", DFOFDM displayed superior results across various evaluation metrics, indicating a promising improvement in the field of speech emotion recognition. In terms of computational complexity, DFOFDM demonstrated effective scalability, providing a feasible solution for large-scale applications. Despite its effectiveness, the study acknowledges the potential limitations of the DFOFDM, which might influence its performance on certain types of real-world data. The findings underline the potential of DFOFDM in advancing emotion recognition techniques, indicating the necessity for further research.

引用

页码：2547 / 2572

页数：26

共 50 条

[41] Speech Emotion Recognition by Late Fusion for Bidirectional Reservoir Computing With Random Projection
Ibrahim, Hemin
Loo, Chu Kiong
Alnajjar, Fady
IEEE ACCESS, 2021, 9 : 122855 - 122871
[42] Emotion Recognition from Speech: An Unsupervised Learning Approach
Rovetta, Stefano
Mnasri, Zied
Masulli, Francesco
Cabri, Alberto
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 23 - 35
[43] Fine-grained emotion recognition: fusion of physiological signals and facial expressions on spontaneous emotion corpus
Setiawan, Feri
Prabono, Aria Ghora
Khowaja, Sunder Ali
Kim, Wangsoo
Park, Kyoungsoo
Yahya, Bernardo Nugroho
Lee, Seok-Lyong
Hong, Jin Pyo
INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2020, 35 (03) : 162 - 178
[44] A robust feature selection method based on meta-heuristic optimization for speech emotion recognition
Bagadi, Kesava Rao
Sivappagari, Chandra Mohan Reddy
EVOLUTIONARY INTELLIGENCE, 2024, 17 (02) : 993 - 1004
[45] A Hybrid Meta-Heuristic Feature Selection Method Using Golden Ratio and Equilibrium Optimization Algorithms for Speech Emotion Recognition
Dey, Arijit
Chattopadhyay, Soham
Singh, Pawan Kumar
Ahmadian, Ali
Ferrara, Massimiliano
Sarkar, Ram
IEEE ACCESS, 2020, 8 : 200953 - 200970
[46] In-depth investigation of speech emotion recognition studies from past to present -The importance of emotion recognition from speech signal for AI-
Sonmez, Yesimim uLGEN
Varol, Asaf
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 22
[47] Utilizing Computer Vision Algorithms to Detect and Describe Local Features in Images for Emotion Recognition from Speech
Weisskirchen, Norman
Reddy, Mainampati Vasudeva
Wendemuth, Andreas
Siegert, Ingo
PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 428 - 433
[48] Equilibrium Optimizer for Emotion Classification From English Speech Signals
Yue, Liya
Hu, Pei
Chu, Shu-Chuan
Pan, Jeng-Shyang
IEEE ACCESS, 2023, 11 : 134217 - 134229
[49] Low-Order Multi-Level Features for Speech Emotion Recognition
Tamulevicius, Gintautas
Liogiene, Tatjana
BALTIC JOURNAL OF MODERN COMPUTING, 2015, 3 (04): : 234 - 247
[50] Recognition of Human Emotion from a Speech Signal Based on Plutchik's Model
Kaminska, Dorota
Pelikant, Adam
INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2012, 58 (02) : 165 - 170

← 1 2 3 4 5 →