Emotion recognition from speech signals using digital features optimization by diversity measure fusion

被引：0

作者：

Konduru, Ashok Kumar ^{[1
]}

Iqbal, J. L. Mazher ^{[2
]}

机构：

[1] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, Chennai, Tamil Nadu, India

[2] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, ECE, Chennai, Tamil Nadu, India

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2024年 / 46卷 / 01期

关键词：

Hidden markov model; emotion detection; speech signal; artificial intelligence; cuckoo search; distributed diversity measures; FEATURE-SELECTION; ALGORITHM; NETWORKS;

D O I：

10.3233/JIFS-231263

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Emotion recognition from speech signals serves a crucial role in human-computer interaction and behavioral studies. The task, however, presents significant challenges due to the high dimensionality and noisy nature of speech data. This article presents a comprehensive study and analysis of a novel approach, "Digital Features Optimization by Diversity Measure Fusion (DFOFDM)", aimed at addressing these challenges. The paper begins by elucidating the necessity for improved emotion recognition methods, followed by a detailed introduction to DFOFDM. This approach employs acoustic and spectral features from speech signals, coupled with an optimized feature selection process using a fusion of diversity measures. The study's central method involves a Cuckoo Search-based classification strategy, which is tailored for this multi-label problem. The performance of the proposed DFOFDM approach is evaluated extensively. Emotion labels such as 'Angry', 'Happy', and 'Neutral' showed a precision rate over 92%, while other emotions fell within the range of 87% to 90%. Similar performance was observed in terms of recall, with most emotions falling within the 90% to 95% range. The F-Score, another crucial metric, also reflected comparable statistics for each label. Notably, the DFOFDM model showed resilience to label imbalances and noise in speech data, crucial for real-world applications. When compared with a contemporary model, "Transfer Subspace Learning by Least Square Loss (TSLSL)", DFOFDM displayed superior results across various evaluation metrics, indicating a promising improvement in the field of speech emotion recognition. In terms of computational complexity, DFOFDM demonstrated effective scalability, providing a feasible solution for large-scale applications. Despite its effectiveness, the study acknowledges the potential limitations of the DFOFDM, which might influence its performance on certain types of real-world data. The findings underline the potential of DFOFDM in advancing emotion recognition techniques, indicating the necessity for further research.

引用

页码：2547 / 2572

页数：26

共 50 条

[1] Robotic Emotion Recognition Using Two-Level Features Fusion in Audio Signals of Speech
Li, Chang
IEEE SENSORS JOURNAL, 2022, 22 (18) : 17447 - 17454
[2] Emotion recognition in speech signals using optimization based multi-SVNN classifier
Mannepalli, Kasiprasad
Sastry, Panyam Narahari
Suman, Maloji
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (02) : 384 - 397
[3] Speech Emotion Recognition Using Derived Features from Speech Segment and Kernel Principal Component Analysis
Charoendee, Matee
Suchato, Atiwong
Punyabukkana, Proadpran
PROCEEDINGS OF 2017 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2017,
[4] Glowworm swarm based fuzzy classifier with dual features for speech emotion recognition
Rajasekhar, B.
Kamaraju, M.
Sumalatha, V
EVOLUTIONARY INTELLIGENCE, 2022, 15 (02) : 939 - 953
[5] A review on emotion recognition from dialect speech using feature optimization and classification techniques
Thimmaiah, Sunil
Vinay, N. A.
Ravikumar, M. G.
Prasad, S. R.
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73793 - 73793
[6] Exploiting the potentialities of features for speech emotion recognition
Li, Dongdong
Zhou, Yijun
Wang, Zhe
Gao, Daqi
INFORMATION SCIENCES, 2021, 548 : 328 - 343
[7] Improving Speech Emotion Recognition System Using Spectral and Prosodic Features
Chakhtouna, Adil
Sekkate, Sara
Adib, Abdellah
INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 399 - 409
[8] Speech Databases, Speech Features, and Classifiers in Speech Emotion Recognition: A Review
Dar, G. H. Mohmad
Delhibabu, Radhakrishnan
IEEE ACCESS, 2024, 12 : 151122 - 151152
[9] Cat swarm optimized ensemble technique for emotion recognition in speech signals
Butta, Rajasekhar
Maddu, Kamaraju
Vangala, Sumalatha
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (27):
[10] Speech emotion recognition using semi-NMF feature optimization
Bandela, Surekha Reddy
Kumar, T. Kishore
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (05) : 3741 - 3757

← 1 2 3 4 5 →