An Effective Speech Emotion Recognition Model for Multi-Regional Languages Using Threshold-based Feature Selection Algorithm

被引:7
作者
Subramanian, Radhika [1 ]
Aruchamy, Prasanth [1 ]
机构
[1] Sri Venkateswara Coll Engn, Dept Elect & Commun Engn, Sriperumpudur, India
关键词
Speech emotion recognition; Feature selection; Machine learning; Indian regional languages; Feature extraction;
D O I
10.1007/s00034-023-02571-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
At present, there are several communication tools employed to express human emotions. Among the numerous modes of communication, speech is the most predominant one for communicating with people effectively and efficiently. Speech emotion recognition (SER) plays a significant role in several signal processing applications. However, in both feature selection (FS) methods and reliable classifiers, determining their appropriate features has emerged as challenges in identifying the emotions expressed in Indian regional languages. In this work, a novel SER framework has been proposed to classify different speech emotions. Primarily, the proposed framework utilizes a preprocessing phase so as to alleviate the background noise and the artifacts present in input speech signal. Later on, the two new speech attributes related to energy and phase have been integrated with state-of-the-art attributes for examining speech emotion characteristics. The threshold-based feature selection (TFS) algorithm has been introduced to determine the optimal features by applying a statistical approach. An Indian regional language called Tamil Emotional dataset has been created for examining the proposed framework with the aid of standard machine learning and deep learning classifiers. The proposed TFS technique has been more suitable for Indian regional languages since it exhibits a superior performance with 97.96% accuracy compared to Indian English and Malayalam datasets.
引用
收藏
页码:2477 / 2506
页数:30
相关论文
共 34 条
  • [2] Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition
    Agarwal, Gaurav
    Om, Hari
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) : 9961 - 9992
  • [3] [Anonymous], About us
  • [4] Identification/segmentation of indian regional languages with singular value decomposition based feature embedding
    Bhowmick, Anirban
    Biswas, Astik
    AnveshKumar, Nella
    Kottath, Rahul
    [J]. APPLIED ACOUSTICS, 2021, 176
  • [5] Chattopadhyay S, 2022, MULTIMED TOOLS APPL, P1
  • [6] A Hybrid Meta-Heuristic Feature Selection Method Using Golden Ratio and Equilibrium Optimization Algorithms for Speech Emotion Recognition
    Dey, Arijit
    Chattopadhyay, Soham
    Singh, Pawan Kumar
    Ahmadian, Ali
    Ferrara, Massimiliano
    Sarkar, Ram
    [J]. IEEE ACCESS, 2020, 8 : 200953 - 200970
  • [7] Effect of vocal tract dynamics on neural network-based speech recognition: A Bengali language-based study
    Hasan, Md Rakibul
    Hasan, Md Mahbub
    Hossain, Md Zakir
    [J]. EXPERT SYSTEMS, 2022, 39 (09)
  • [8] Multi-Feature Analysis for Automated Brain Stroke Classification Using Weighted Gaussian Naive Bayes Classifier
    Jayachitra, S.
    Prasanth, A.
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (10)
  • [9] Machine learning techniques for speech emotion recognition using paralinguistic acoustic features
    Jha T.
    Kavya R.
    Christopher J.
    Arunachalam V.
    [J]. International Journal of Speech Technology, 2022, 25 (03): : 707 - 725
  • [10] An effective motion object detection using adaptive background modeling mechanism in video surveillance system
    Kalli, SivaNagiReddy
    Suresh, T.
    Prasanth, A.
    Muthumanickam, T.
    Mohanram, K.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (01) : 1777 - 1789