Speech emotion recognition using multimodal feature fusion with machine learning approach

被引:0
|
作者
Sandeep Kumar Panda
Ajay Kumar Jena
Mohit Ranjan Panda
Susmita Panda
机构
[1] ICFAI Foundation for Higher Education (Deemed to Be University),Department of Data Science and Artificial Intelligence, Faculty of Science and Technology (IcfaiTech)
[2] KIIT Deemed to Be University,School of Computer Engineering
[3] SOA (Deemed to Be University),Department of Computer Science and Engineering
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Feature fusion (FF); Speech emotion recognition (SER); Mel frequency cepstral coefficients; Zero Crossing Rate; Support vector machine; XGBoost;
D O I
暂无
中图分类号
学科分类号
摘要
Speech-based emotional state recognition must have a significant impact on artificial intelligence as machine learning advances. When it comes to emotion recognition, proper feature selection is critical. As a result, feature fusion technology is offered in this work as a means of achieving high prediction accuracy by emphasizing the extraction of sole features. Mel Frequency Cepstral Coefficients (MFCC), Zero Crossing Rate (ZCR), Mel Spectrogram, Short-time Fourier transform (STFT) and Root Mean Square (RMS) are extracted, and four different feature fusion techniques are used on five standard machine learning classifiers: XGBoost, Support Vector Machine (SVM), Random Forest, Decision-Tree (D-Tree), and K Nearest Neighbor (KNN). The successful use of feature fusion techniques on our suggested classifier yields a satisfactory recognition rate of 99.64% on the female only dataset (TESS), 91% on SAVEE (male only dataset) and 86% on CREMA-D (both male and female) dataset. The proposed model shows that effective feature fusion improves the accuracy and applicability of emotion detection systems.
引用
收藏
页码:42763 / 42781
页数:18
相关论文
共 50 条
  • [1] Speech emotion recognition using multimodal feature fusion with machine learning approach
    Panda, Sandeep Kumar
    Jena, Ajay Kumar
    Panda, Mohit Ranjan
    Panda, Susmita
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42763 - 42781
  • [2] Speech emotion recognition using feature fusion: a hybrid approach to deep learning
    Khan, Waleed Akram
    ul Qudous, Hamad
    Farhan, Asma Ahmad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 75557 - 75584
  • [3] Machine learning approach of speech emotions recognition using feature fusion technique
    Paul, Bachchu
    Bera, Somnath
    Dey, Tanushree
    Phadikar, Santanu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8663 - 8688
  • [4] Machine learning approach of speech emotions recognition using feature fusion technique
    Bachchu Paul
    Somnath Bera
    Tanushree Dey
    Santanu Phadikar
    Multimedia Tools and Applications, 2024, 83 : 8663 - 8688
  • [5] Speech emotion recognition based on multimodal and multiscale feature fusion
    Hu, Huangshui
    Wei, Jie
    Sun, Hongyu
    Wang, Chuhang
    Tao, Shuo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [6] A FEATURE FUSION METHOD BASED ON EXTREME LEARNING MACHINE FOR SPEECH EMOTION RECOGNITION
    Guo, Lili
    Wang, Longbiao
    Dang, Jianwu
    Zhang, Linjuan
    Guan, Haotian
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2666 - 2670
  • [7] Distinctive Approach for Speech Emotion Recognition Using Machine Learning
    Singh, Yogyata
    Neetu
    Rani, Shikha
    MACHINE LEARNING, IMAGE PROCESSING, NETWORK SECURITY AND DATA SCIENCES, MIND 2022, PT I, 2022, 1762 : 39 - 51
  • [8] Multimodal Emotion Recognition Using Feature Fusion: An LLM-Based Approach
    Chandraumakantham, Omkumar
    Gowtham, N.
    Zakariah, Mohammed
    Almazyad, Abdulaziz
    IEEE ACCESS, 2024, 12 : 108052 - 108071
  • [9] Machine Learning Approach for Emotion Recognition in Speech
    Gjoreski, Martin
    Gjoreski, Hristijan
    Kulakov, Andrea
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2014, 38 (04): : 377 - 383
  • [10] Multimodal machine learning approach for emotion recognition using physiological signals
    Ramadan, Mohamad A.
    Salem, Nancy M.
    Mahmoud, Lamees N.
    Sadek, Ibrahim
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96