A Machine Learning Approach for Emotion Classification in Bengali Speech

被引:0
|
作者
Islam, Md. Rakibul [1 ]
Akhi, Amatul Bushra [1 ]
Akter, Farzana [2 ]
Rashid, Md Wasiul [1 ]
Rumu, Ambia Islam [3 ]
Lata, Munira Akter [4 ]
Ashrafuzzaman, Md. [4 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Bangabandhu Sheikh Mujibur Rahman Digital Univ, Dept IoT & Robot Engn, Kaliakair, Bangladesh
[3] Daffodil Int Univ, Dept English, Dhaka, Bangladesh
[4] Bangabandhu Sheikh Mujibur Rahman Digital Univ, Dept Educ Technol, Kaliakair, Bangladesh
关键词
XgBoost; gradient boosting; CatBoost; random forest; MFCC;
D O I
10.14569/IJACSA.2023.0141093
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this research work, we have presented a machine learning strategy for Bengali speech emotion categorization with a focus on Mel-frequency cepstral coefficients (MFCC) as features. The commonly utilized method of MFCC in speech processing has proved effective in obtaining crucial phoneme-specific data. This paper analyzes the efficacy of four machine learning algorithms: Random Forest, XGBoost, CatBoost, and Gradient Boosting, and tackles the paucity of research on emotion categorization in non-English languages, particularly Bengali. With CatBoost obtaining the greatest accuracy of 82.85%, Gradient Boosting coming in second with 81.19%, XGBoost coming in third with 80.03%, and Random Forest coming in fourth with 80.01%, experimental evaluation shows encouraging outcomes. MFCC features improve classification precision and offer insightful information on the distinctive qualities of emotions expressed in Bengali speech. By demonstrating how well MFCC characteristics can identify emotions in Bengali speech, this study advances the field of emotion classification. Future research can investigate more sophisticated feature extraction methods, look into how temporal dynamics are incorporated into emotion classification models, and investigate practical uses for emotion detection systems in Bengali speech. This study advances our knowledge of emotion classification and paves the way for more effective emotion identification systems in Bengali speech by utilizing MFCC and machine learning techniques. Our work addresses the need for thorough and efficient techniques to recognize and classify emotions in speech signals in the context of essential for many applications, as they are a basic component of human communication. By investigating the precision and effectiveness of emotion recognition, this study advances the field of emotion classification.
引用
收藏
页码:885 / 892
页数:8
相关论文
共 50 条
  • [21] Applying Machine Learning Techniques for Speech Emotion Recognition
    Tarunika, K.
    Pradeeba, R. B.
    Aruna, P.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [22] Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition
    Xu, Xinzhou
    Deng, Jun
    Coutinho, Eduardo
    Wu, Chen
    Zhao, Li
    Schuller, Bjoern W.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) : 795 - 808
  • [23] Speech emotion recognition of Hindi speech using statistical and machine learning techniques
    Agrawal, Akshat
    Jain, Anurag
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2020, 23 (01) : 311 - 319
  • [24] Machine learning methods for speech emotion recognition on telecommunication systems
    Osipov, Alexey
    Pleshakova, Ekaterina
    Liu, Yang
    Gataullin, Sergey
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2024, 20 (03) : 415 - 428
  • [25] Speech Emotion Recognition Using Machine Learning: A Comparative Analysis
    Nath S.
    Shahi A.K.
    Martin T.
    Choudhury N.
    Mandal R.
    SN Computer Science, 5 (4)
  • [26] Speech emotion recognition using machine learning - A systematic review
    Madanian, Samaneh
    Chen, Talen
    Adeleye, Olayinka
    Templeton, John Michael
    Poellabauer, Christian
    Parry, Dave
    Schneidere, Sandra L.
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [27] Feature extraction and classification efficiency analysis using machine learning approach for speech signal
    Singh, Mahesh K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 47069 - 47084
  • [28] Feature extraction and classification efficiency analysis using machine learning approach for speech signal
    Mahesh K. Singh
    Multimedia Tools and Applications, 2024, 83 : 47069 - 47084
  • [29] SPEECH EMOTION RECOGNITION-A DEEP LEARNING APPROACH
    Asiya, U. A.
    Kiran, V. K.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 867 - 871
  • [30] Emotion Recognition from Speech: An Unsupervised Learning Approach
    Rovetta, Stefano
    Mnasri, Zied
    Masulli, Francesco
    Cabri, Alberto
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 23 - 35