A Machine Learning Approach for Emotion Classification in Bengali Speech

被引:0
|
作者
Islam, Md. Rakibul [1 ]
Akhi, Amatul Bushra [1 ]
Akter, Farzana [2 ]
Rashid, Md Wasiul [1 ]
Rumu, Ambia Islam [3 ]
Lata, Munira Akter [4 ]
Ashrafuzzaman, Md. [4 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Bangabandhu Sheikh Mujibur Rahman Digital Univ, Dept IoT & Robot Engn, Kaliakair, Bangladesh
[3] Daffodil Int Univ, Dept English, Dhaka, Bangladesh
[4] Bangabandhu Sheikh Mujibur Rahman Digital Univ, Dept Educ Technol, Kaliakair, Bangladesh
关键词
XgBoost; gradient boosting; CatBoost; random forest; MFCC;
D O I
10.14569/IJACSA.2023.0141093
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this research work, we have presented a machine learning strategy for Bengali speech emotion categorization with a focus on Mel-frequency cepstral coefficients (MFCC) as features. The commonly utilized method of MFCC in speech processing has proved effective in obtaining crucial phoneme-specific data. This paper analyzes the efficacy of four machine learning algorithms: Random Forest, XGBoost, CatBoost, and Gradient Boosting, and tackles the paucity of research on emotion categorization in non-English languages, particularly Bengali. With CatBoost obtaining the greatest accuracy of 82.85%, Gradient Boosting coming in second with 81.19%, XGBoost coming in third with 80.03%, and Random Forest coming in fourth with 80.01%, experimental evaluation shows encouraging outcomes. MFCC features improve classification precision and offer insightful information on the distinctive qualities of emotions expressed in Bengali speech. By demonstrating how well MFCC characteristics can identify emotions in Bengali speech, this study advances the field of emotion classification. Future research can investigate more sophisticated feature extraction methods, look into how temporal dynamics are incorporated into emotion classification models, and investigate practical uses for emotion detection systems in Bengali speech. This study advances our knowledge of emotion classification and paves the way for more effective emotion identification systems in Bengali speech by utilizing MFCC and machine learning techniques. Our work addresses the need for thorough and efficient techniques to recognize and classify emotions in speech signals in the context of essential for many applications, as they are a basic component of human communication. By investigating the precision and effectiveness of emotion recognition, this study advances the field of emotion classification.
引用
收藏
页码:885 / 892
页数:8
相关论文
共 50 条
  • [11] Paralinguistic and spectral feature extraction for speech emotion classification using machine learning techniques
    Tong Liu
    Xiaochen Yuan
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [12] Speech emotion recognition using multimodal feature fusion with machine learning approach
    Panda, Sandeep Kumar
    Jena, Ajay Kumar
    Panda, Mohit Ranjan
    Panda, Susmita
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42763 - 42781
  • [13] Speech emotion recognition using multimodal feature fusion with machine learning approach
    Sandeep Kumar Panda
    Ajay Kumar Jena
    Mohit Ranjan Panda
    Susmita Panda
    Multimedia Tools and Applications, 2023, 82 : 42763 - 42781
  • [14] Speech Emotion Classification Using Deep Learning
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 19 - 31
  • [15] A Machine Learning Approach for Authorship Attribution for Bengali Blogs
    Phani, Shanta
    Lahiri, Shibamouli
    Biswas, Arindam
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 271 - 274
  • [16] A Machine Learning Approach to Speech Act Classification Using Function Words
    O'Shea, James
    Bandar, Zuhair
    Crockett, Keeley
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PT II, PROCEEDINGS, 2010, 6071 : 82 - 91
  • [17] Machine Learning in Human Emotion Detection from the Speech
    Qiu, Xiaoli
    Li, Wei
    Li, Yang
    Gu, Hongmei
    Song, Fei
    Sabitha, R.
    JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP04)
  • [18] Emotion Recognition On Speech Signals Using Machine Learning
    Ghai, Mohan
    Lal, Shamit
    Duggal, Shivam
    Manik, Shrey
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND COMPUTATIONAL INTELLIGENCE (ICBDAC), 2017, : 34 - 39
  • [19] Transfer Learning for Improving Speech Emotion Classification Accuracy
    Latif, Siddique
    Rana, Rajib
    Younis, Shahzad
    Qadir, Junaid
    Epps, Julien
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 257 - 261
  • [20] Speech based Emotion Recognition using Machine Learning
    Deshmukh, Girija
    Gaonkar, Apurva
    Golwalkar, Gauri
    Kulkarni, Sukanya
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 812 - 817