Emotion recognition from speech signals using digital features optimization by diversity measure fusion

被引:0
|
作者
Konduru, Ashok Kumar [1 ]
Iqbal, J. L. Mazher [2 ]
机构
[1] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, Chennai, Tamil Nadu, India
[2] Veltech Rangarajan Dr Sagunthala R&D Inst Sci & T, ECE, Chennai, Tamil Nadu, India
关键词
Hidden markov model; emotion detection; speech signal; artificial intelligence; cuckoo search; distributed diversity measures; FEATURE-SELECTION; ALGORITHM; NETWORKS;
D O I
10.3233/JIFS-231263
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition from speech signals serves a crucial role in human-computer interaction and behavioral studies. The task, however, presents significant challenges due to the high dimensionality and noisy nature of speech data. This article presents a comprehensive study and analysis of a novel approach, "Digital Features Optimization by Diversity Measure Fusion (DFOFDM)", aimed at addressing these challenges. The paper begins by elucidating the necessity for improved emotion recognition methods, followed by a detailed introduction to DFOFDM. This approach employs acoustic and spectral features from speech signals, coupled with an optimized feature selection process using a fusion of diversity measures. The study's central method involves a Cuckoo Search-based classification strategy, which is tailored for this multi-label problem. The performance of the proposed DFOFDM approach is evaluated extensively. Emotion labels such as 'Angry', 'Happy', and 'Neutral' showed a precision rate over 92%, while other emotions fell within the range of 87% to 90%. Similar performance was observed in terms of recall, with most emotions falling within the 90% to 95% range. The F-Score, another crucial metric, also reflected comparable statistics for each label. Notably, the DFOFDM model showed resilience to label imbalances and noise in speech data, crucial for real-world applications. When compared with a contemporary model, "Transfer Subspace Learning by Least Square Loss (TSLSL)", DFOFDM displayed superior results across various evaluation metrics, indicating a promising improvement in the field of speech emotion recognition. In terms of computational complexity, DFOFDM demonstrated effective scalability, providing a feasible solution for large-scale applications. Despite its effectiveness, the study acknowledges the potential limitations of the DFOFDM, which might influence its performance on certain types of real-world data. The findings underline the potential of DFOFDM in advancing emotion recognition techniques, indicating the necessity for further research.
引用
收藏
页码:2547 / 2572
页数:26
相关论文
共 50 条
  • [41] Speech Emotion Recognition by Late Fusion for Bidirectional Reservoir Computing With Random Projection
    Ibrahim, Hemin
    Loo, Chu Kiong
    Alnajjar, Fady
    IEEE ACCESS, 2021, 9 : 122855 - 122871
  • [42] Emotion Recognition from Speech: An Unsupervised Learning Approach
    Rovetta, Stefano
    Mnasri, Zied
    Masulli, Francesco
    Cabri, Alberto
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 23 - 35
  • [43] Fine-grained emotion recognition: fusion of physiological signals and facial expressions on spontaneous emotion corpus
    Setiawan, Feri
    Prabono, Aria Ghora
    Khowaja, Sunder Ali
    Kim, Wangsoo
    Park, Kyoungsoo
    Yahya, Bernardo Nugroho
    Lee, Seok-Lyong
    Hong, Jin Pyo
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2020, 35 (03) : 162 - 178
  • [44] A robust feature selection method based on meta-heuristic optimization for speech emotion recognition
    Bagadi, Kesava Rao
    Sivappagari, Chandra Mohan Reddy
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (02) : 993 - 1004
  • [45] A Hybrid Meta-Heuristic Feature Selection Method Using Golden Ratio and Equilibrium Optimization Algorithms for Speech Emotion Recognition
    Dey, Arijit
    Chattopadhyay, Soham
    Singh, Pawan Kumar
    Ahmadian, Ali
    Ferrara, Massimiliano
    Sarkar, Ram
    IEEE ACCESS, 2020, 8 : 200953 - 200970
  • [46] In-depth investigation of speech emotion recognition studies from past to present -The importance of emotion recognition from speech signal for AI-
    Sonmez, Yesimim uLGEN
    Varol, Asaf
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 22
  • [47] Utilizing Computer Vision Algorithms to Detect and Describe Local Features in Images for Emotion Recognition from Speech
    Weisskirchen, Norman
    Reddy, Mainampati Vasudeva
    Wendemuth, Andreas
    Siegert, Ingo
    PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 428 - 433
  • [48] Equilibrium Optimizer for Emotion Classification From English Speech Signals
    Yue, Liya
    Hu, Pei
    Chu, Shu-Chuan
    Pan, Jeng-Shyang
    IEEE ACCESS, 2023, 11 : 134217 - 134229
  • [49] Low-Order Multi-Level Features for Speech Emotion Recognition
    Tamulevicius, Gintautas
    Liogiene, Tatjana
    BALTIC JOURNAL OF MODERN COMPUTING, 2015, 3 (04): : 234 - 247
  • [50] Recognition of Human Emotion from a Speech Signal Based on Plutchik's Model
    Kaminska, Dorota
    Pelikant, Adam
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2012, 58 (02) : 165 - 170