An Effective Speech Emotion Recognition Model for Multi-Regional Languages Using Threshold-based Feature Selection Algorithm

被引:0
作者
Radhika Subramanian
Prasanth Aruchamy
机构
[1] Sri Venkateswara College of Engineering,Department of Electronics and Communication Engineering
来源
Circuits, Systems, and Signal Processing | 2024年 / 43卷
关键词
Speech emotion recognition; Feature selection; Machine learning; Indian regional languages; Feature extraction;
D O I
暂无
中图分类号
学科分类号
摘要
At present, there are several communication tools employed to express human emotions. Among the numerous modes of communication, speech is the most predominant one for communicating with people effectively and efficiently. Speech emotion recognition (SER) plays a significant role in several signal processing applications. However, in both feature selection (FS) methods and reliable classifiers, determining their appropriate features has emerged as challenges in identifying the emotions expressed in Indian regional languages. In this work, a novel SER framework has been proposed to classify different speech emotions. Primarily, the proposed framework utilizes a preprocessing phase so as to alleviate the background noise and the artifacts present in input speech signal. Later on, the two new speech attributes related to energy and phase have been integrated with state-of-the-art attributes for examining speech emotion characteristics. The threshold-based feature selection (TFS) algorithm has been introduced to determine the optimal features by applying a statistical approach. An Indian regional language called Tamil Emotional dataset has been created for examining the proposed framework with the aid of standard machine learning and deep learning classifiers. The proposed TFS technique has been more suitable for Indian regional languages since it exhibits a superior performance with 97.96% accuracy compared to Indian English and Malayalam datasets.
引用
收藏
页码:2477 / 2506
页数:29
相关论文
共 63 条
  • [1] Abdel-Hamid L(2020)Egyptian Arabic speech emotion recognition using prosodic, spectral and wavelet features Speech Commun. 122 9-30
  • [2] Agarwal G(2021)Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition Multimed. Tools Appl. 80 9961-9992
  • [3] Om H(2021)Identification/segmentation of Indian regional languages with singular value decomposition based feature embedding Appl. Acoust. 176 1-15
  • [4] Bhowmick A(2022)A feature selection model for speech emotion recognition using clustering-based population generation with hybrid of equilibrium optimizer and atom search optimization algorithm Multimed. Tools Appl. 24 1-34
  • [5] Biswas A(2022)Effect of vocal tract dynamics on neural network-based speech recognition: A Bengali language-based study Expert. Syst. 39 1-22
  • [6] Chattopadhyay S(2021)Multi-feature analysis for automated brain stroke classification using weighted Gaussian Naïve Baye’s classifier J. Circuits Syst. Comp. 30 1-26
  • [7] Dey A(2021)An effective motion object detection using adaptive background modeling mechanism in video surveillance system J. Intell. Fuzzy Syst. 41 777-1789
  • [8] Singh PK(2022)Enhanced depression detection from speech using Quantum Whale Optimization Algorithm for feature selection Comput. Biol. Med. 150 1-15
  • [9] Ahmadian A(2022)Impact of feature extraction and feature selection algorithms on Punjabi speech emotion recognition using convolutional neural network Trans. Asian Low-Resour. Lang. Inform. Process. 21 1-23
  • [10] Sarkar R(2020)Feature extraction algorithms to improve the speech emotion recognition rate Int. J. Speech Technol. 23 45-55