Emotion recognition from speech signals using digital features optimization by diversity measure fusion

被引:0
|
作者
Konduru, Ashok Kumar [1 ]
Iqbal, J. L. Mazher [2 ]
机构
[1] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, Chennai, Tamil Nadu, India
[2] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, ECE, Chennai, Tamil Nadu, India
关键词
Hidden markov model; emotion detection; speech signal; artificial intelligence; cuckoo search; distributed diversity measures; FEATURE-SELECTION; ALGORITHM; NETWORKS;
D O I
10.3233/JIFS-231263
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition from speech signals serves a crucial role in human-computer interaction and behavioral studies. The task, however, presents significant challenges due to the high dimensionality and noisy nature of speech data. This article presents a comprehensive study and analysis of a novel approach, "Digital Features Optimization by Diversity Measure Fusion (DFOFDM)", aimed at addressing these challenges. The paper begins by elucidating the necessity for improved emotion recognition methods, followed by a detailed introduction to DFOFDM. This approach employs acoustic and spectral features from speech signals, coupled with an optimized feature selection process using a fusion of diversity measures. The study's central method involves a Cuckoo Search-based classification strategy, which is tailored for this multi-label problem. The performance of the proposed DFOFDM approach is evaluated extensively. Emotion labels such as 'Angry', 'Happy', and 'Neutral' showed a precision rate over 92%, while other emotions fell within the range of 87% to 90%. Similar performance was observed in terms of recall, with most emotions falling within the 90% to 95% range. The F-Score, another crucial metric, also reflected comparable statistics for each label. Notably, the DFOFDM model showed resilience to label imbalances and noise in speech data, crucial for real-world applications. When compared with a contemporary model, "Transfer Subspace Learning by Least Square Loss (TSLSL)", DFOFDM displayed superior results across various evaluation metrics, indicating a promising improvement in the field of speech emotion recognition. In terms of computational complexity, DFOFDM demonstrated effective scalability, providing a feasible solution for large-scale applications. Despite its effectiveness, the study acknowledges the potential limitations of the DFOFDM, which might influence its performance on certain types of real-world data. The findings underline the potential of DFOFDM in advancing emotion recognition techniques, indicating the necessity for further research.
引用
收藏
页码:2547 / 2572
页数:26
相关论文
共 50 条
  • [1] Robotic Emotion Recognition Using Two-Level Features Fusion in Audio Signals of Speech
    Li, Chang
    IEEE SENSORS JOURNAL, 2022, 22 (18) : 17447 - 17454
  • [2] Emotion recognition in speech signals using optimization based multi-SVNN classifier
    Mannepalli, Kasiprasad
    Sastry, Panyam Narahari
    Suman, Maloji
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (02) : 384 - 397
  • [3] Speech Emotion Recognition Using Derived Features from Speech Segment and Kernel Principal Component Analysis
    Charoendee, Matee
    Suchato, Atiwong
    Punyabukkana, Proadpran
    PROCEEDINGS OF 2017 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2017,
  • [4] Glowworm swarm based fuzzy classifier with dual features for speech emotion recognition
    Rajasekhar, B.
    Kamaraju, M.
    Sumalatha, V
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (02) : 939 - 953
  • [5] A review on emotion recognition from dialect speech using feature optimization and classification techniques
    Thimmaiah, Sunil
    Vinay, N. A.
    Ravikumar, M. G.
    Prasad, S. R.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73793 - 73793
  • [6] Exploiting the potentialities of features for speech emotion recognition
    Li, Dongdong
    Zhou, Yijun
    Wang, Zhe
    Gao, Daqi
    INFORMATION SCIENCES, 2021, 548 : 328 - 343
  • [7] Improving Speech Emotion Recognition System Using Spectral and Prosodic Features
    Chakhtouna, Adil
    Sekkate, Sara
    Adib, Abdellah
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 399 - 409
  • [8] Speech Databases, Speech Features, and Classifiers in Speech Emotion Recognition: A Review
    Dar, G. H. Mohmad
    Delhibabu, Radhakrishnan
    IEEE ACCESS, 2024, 12 : 151122 - 151152
  • [9] Cat swarm optimized ensemble technique for emotion recognition in speech signals
    Butta, Rajasekhar
    Maddu, Kamaraju
    Vangala, Sumalatha
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (27):
  • [10] Speech emotion recognition using semi-NMF feature optimization
    Bandela, Surekha Reddy
    Kumar, T. Kishore
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (05) : 3741 - 3757