An Effective Speech Emotion Recognition Model for Multi-Regional Languages Using Threshold-based Feature Selection Algorithm

被引：7

作者：

Subramanian, Radhika ^{[1
]}

Aruchamy, Prasanth ^{[1
]}

机构：

[1] Sri Venkateswara Coll Engn, Dept Elect & Commun Engn, Sriperumpudur, India

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2024年 / 43卷 / 04期

关键词：

Speech emotion recognition; Feature selection; Machine learning; Indian regional languages; Feature extraction;

D O I：

10.1007/s00034-023-02571-4

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

At present, there are several communication tools employed to express human emotions. Among the numerous modes of communication, speech is the most predominant one for communicating with people effectively and efficiently. Speech emotion recognition (SER) plays a significant role in several signal processing applications. However, in both feature selection (FS) methods and reliable classifiers, determining their appropriate features has emerged as challenges in identifying the emotions expressed in Indian regional languages. In this work, a novel SER framework has been proposed to classify different speech emotions. Primarily, the proposed framework utilizes a preprocessing phase so as to alleviate the background noise and the artifacts present in input speech signal. Later on, the two new speech attributes related to energy and phase have been integrated with state-of-the-art attributes for examining speech emotion characteristics. The threshold-based feature selection (TFS) algorithm has been introduced to determine the optimal features by applying a statistical approach. An Indian regional language called Tamil Emotional dataset has been created for examining the proposed framework with the aid of standard machine learning and deep learning classifiers. The proposed TFS technique has been more suitable for Indian regional languages since it exhibits a superior performance with 97.96% accuracy compared to Indian English and Malayalam datasets.

引用

页码：2477 / 2506

页数：30

共 34 条

[1] Egyptian Arabic speech emotion recognition using prosodic, spectral and wavelet features
Abdel-Hamid, Lamiaa
[J]. SPEECH COMMUNICATION, 2020, 122 : 19 - 30
[2] Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition
Agarwal, Gaurav
Om, Hari
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) : 9961 - 9992
[3] [Anonymous], About us
[4] Identification/segmentation of indian regional languages with singular value decomposition based feature embedding
Bhowmick, Anirban
Biswas, Astik
AnveshKumar, Nella
Kottath, Rahul
[J]. APPLIED ACOUSTICS, 2021, 176
[5] Chattopadhyay S, 2022, MULTIMED TOOLS APPL, P1
[6] A Hybrid Meta-Heuristic Feature Selection Method Using Golden Ratio and Equilibrium Optimization Algorithms for Speech Emotion Recognition
Dey, Arijit
Chattopadhyay, Soham
Singh, Pawan Kumar
Ahmadian, Ali
Ferrara, Massimiliano
Sarkar, Ram
[J]. IEEE ACCESS, 2020, 8 : 200953 - 200970
[7] Effect of vocal tract dynamics on neural network-based speech recognition: A Bengali language-based study
Hasan, Md Rakibul
Hasan, Md Mahbub
Hossain, Md Zakir
[J]. EXPERT SYSTEMS, 2022, 39 (09)
[8] Multi-Feature Analysis for Automated Brain Stroke Classification Using Weighted Gaussian Naive Bayes Classifier
Jayachitra, S.
Prasanth, A.
[J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (10)
[9] Machine learning techniques for speech emotion recognition using paralinguistic acoustic features
Jha T.
Kavya R.
Christopher J.
Arunachalam V.
[J]. International Journal of Speech Technology, 2022, 25 (03): : 707 - 725
[10] An effective motion object detection using adaptive background modeling mechanism in video surveillance system
Kalli, SivaNagiReddy
Suresh, T.
Prasanth, A.
Muthumanickam, T.
Mohanram, K.
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (01) : 1777 - 1789

← 1 2 3 4 →