A Reliable speech emotion recognition framework for multi-regional languages using optimized light gradient boosting machine classifier

被引:2
作者
Radhika, Subramanian [1 ]
Prasanth, Aruchamy [2 ]
Sowndarya, K. K. Devi [3 ]
机构
[1] Sri Venkateswara Coll Engn, Dept Elect & Commun Engn, Sriperumbudur, Tamil Nadu, India
[2] Vel Tech Rangarajan Dr Sagunthala R&D Inst Sci & T, Dept Comp Sci & Engn, Chennai, India
[3] DMI Coll Engn, Dept Elect & Commun Engn, Chennai, Tamil Nadu, India
关键词
Signal Processing; Speech Emotion Recognition; Feature Selection; Machine Learning; Indian regional language;
D O I
10.1016/j.bspc.2025.107636
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In today's world, the interpretation of emotions from human speech has prompted a lot of research attention in signal processing applications. Many speech recognition approaches have been introduced to detect the psychological states of individuals for public datasets. However, identifying the different emotions in the Indian regional language is one of the most demanding promises. As a consequence of inappropriate feature selection techniques, existing machine learning classifiers produce more misclassification errors for Indian regional datasets. Therefore, this work proposes a novel machine learning TEL (TF-EPOA + LGBM) classifier for accurately predicting human emotions. In the first stage, the pre-processing of the speech samples can be carried out to eradicate the artifacts. To express personal emotions in speech, the threshold-based feature selection technique is utilized to select the appropriate speech features. The optimal speech features are fed into the Light Gradient Boosting Machine (LGBM) classifier to significantly determine the emotions. The hyperparameters of the LGBM classifier are properly tuned with the aid of a Hybrid optimization where the Tangent Flight Operator (TF) is incorporated with the Exponential Pelican Optimization Algorithm (EPOA). This effective tuning facilitates the proposed classifier to achieve a superior classification accuracy of 99.27 %, 98.4 %, and 96.7 % for the Tamil, Malayalam, and English language datasets, respectively.
引用
收藏
页数:16
相关论文
共 39 条
[1]   Machine learning techniques for emotion detection and sentiment analysis: current state, challenges, and future directions [J].
Alslaity, Alaa ;
Orji, Rita .
BEHAVIOUR & INFORMATION TECHNOLOGY, 2024, 43 (01) :139-164
[2]   Speech Emotion Recognition Systems: A Comprehensive Review on Different Methodologies [J].
Anthony, Audre Arlene ;
Patil, Chandreshekar Mohan .
WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (01) :515-525
[3]  
Aouani H, 2021, INTELLIGENT SYSTEMS, V19, P406
[4]   A robust feature selection method based on meta-heuristic optimization for speech emotion recognition [J].
Bagadi, Kesava Rao ;
Sivappagari, Chandra Mohan Reddy .
EVOLUTIONARY INTELLIGENCE, 2024, 17 (02) :993-1004
[5]  
Balasubramaniam S, 2022, Comput Intell Neurosci, V2022, P2819378, DOI 10.1155/2022/2819378
[6]   An energy-aware software fault detection system based on hierarchical rule approach for enhancing quality of service in internet of things-enabled wireless sensor network [J].
Balraj, Lavina ;
Prasanth, Aruchamy .
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (04)
[7]   Bagged support vector machines for emotion recognition from speech [J].
Bhavan, Anjali ;
Chauhan, Pankaj ;
Hitkul ;
Shah, Rajiv Ratn .
KNOWLEDGE-BASED SYSTEMS, 2019, 184
[8]  
Chang J, 2024, IEEE J BIOMED HEALTH, V28, P2025, DOI 10.1109/JBHI.2024.3360151
[9]   K-means Pelican Optimization Algorithm based Search Space Reduction for Remote Sensing Image Retrieval [J].
Chembian, W. T. ;
Senthilkumar, G. ;
Prasanth, A. ;
Subash, R. .
JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2025, 53 (01) :101-115
[10]   Two-layer fuzzy multiple random forest for speech emotion recognition in human-robot interaction [J].
Chen, Luefeng ;
Su, Wanjuan ;
Feng, Yu ;
Wu, Min ;
She, Jinhua ;
Hirota, Kaoru .
INFORMATION SCIENCES, 2020, 509 :150-163