A Machine Learning Approach for Emotion Classification in Bengali Speech

被引:0
|
作者
Islam, Md. Rakibul [1 ]
Akhi, Amatul Bushra [1 ]
Akter, Farzana [2 ]
Rashid, Md Wasiul [1 ]
Rumu, Ambia Islam [3 ]
Lata, Munira Akter [4 ]
Ashrafuzzaman, Md. [4 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Bangabandhu Sheikh Mujibur Rahman Digital Univ, Dept IoT & Robot Engn, Kaliakair, Bangladesh
[3] Daffodil Int Univ, Dept English, Dhaka, Bangladesh
[4] Bangabandhu Sheikh Mujibur Rahman Digital Univ, Dept Educ Technol, Kaliakair, Bangladesh
关键词
XgBoost; gradient boosting; CatBoost; random forest; MFCC;
D O I
10.14569/IJACSA.2023.0141093
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this research work, we have presented a machine learning strategy for Bengali speech emotion categorization with a focus on Mel-frequency cepstral coefficients (MFCC) as features. The commonly utilized method of MFCC in speech processing has proved effective in obtaining crucial phoneme-specific data. This paper analyzes the efficacy of four machine learning algorithms: Random Forest, XGBoost, CatBoost, and Gradient Boosting, and tackles the paucity of research on emotion categorization in non-English languages, particularly Bengali. With CatBoost obtaining the greatest accuracy of 82.85%, Gradient Boosting coming in second with 81.19%, XGBoost coming in third with 80.03%, and Random Forest coming in fourth with 80.01%, experimental evaluation shows encouraging outcomes. MFCC features improve classification precision and offer insightful information on the distinctive qualities of emotions expressed in Bengali speech. By demonstrating how well MFCC characteristics can identify emotions in Bengali speech, this study advances the field of emotion classification. Future research can investigate more sophisticated feature extraction methods, look into how temporal dynamics are incorporated into emotion classification models, and investigate practical uses for emotion detection systems in Bengali speech. This study advances our knowledge of emotion classification and paves the way for more effective emotion identification systems in Bengali speech by utilizing MFCC and machine learning techniques. Our work addresses the need for thorough and efficient techniques to recognize and classify emotions in speech signals in the context of essential for many applications, as they are a basic component of human communication. By investigating the precision and effectiveness of emotion recognition, this study advances the field of emotion classification.
引用
收藏
页码:885 / 892
页数:8
相关论文
共 50 条
  • [31] Deep Learning, Ensemble and Supervised Machine Learning for Arabic Speech Emotion Recognition
    Ismaiel, Wahiba
    Alhalangy, Abdalilah
    Mohamed, Adil. O. Y.
    Musa, Abdalla Ibrahim Abdalla
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (02) : 13757 - 13764
  • [32] Machine-learning based classification of speech and music
    M. Kashif Saeed Khan
    Wasfi G. Al-Khatib
    Multimedia Systems, 2006, 12 : 55 - 67
  • [33] On the use of Machine Learning Methods for Speech and Voicing Classification
    Harding, Philip
    Milner, Ben
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1979 - 1982
  • [34] Machine-learning based classification of speech and music
    Khan, M. Kashif Saeed
    Al-Khatib, Wasfi G.
    MULTIMEDIA SYSTEMS, 2006, 12 (01) : 55 - 67
  • [35] Analysis of Machine Learning Methods for Speech Disfluencies' Classification
    Sharma, Nitin Mohan
    Mahapatra, Prasant Kumar
    Gandhi, Vaibhav
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 2, ICICT 2024, 2024, 1012 : 13 - 22
  • [36] Emotion Profile Refinery for Speech Emotion Classification
    Mao, Shuiyang
    Ching, P. C.
    Lee, Tan
    INTERSPEECH 2020, 2020, : 531 - 535
  • [37] A Machine Learning Approach to Classification of Okra
    Diop, Papa Moussa
    Takamoto, Jin
    Nakamura, Yuji
    Nakamura, Morikazu
    35TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2020), 2020, : 254 - 257
  • [38] Automatic Song Genre Classification in Bengali Music: A Comparative Study of Machine Learning and Deep Learning Approaches
    Humayra, Atika
    Sohag, Md Maruf Kamran
    Anwer, Mohammed
    Hasan, Mahady
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 273 - 277
  • [39] Voted approach for part of speech tagging in Bengali
    Department of Computational Linguistics, University of Heidelberg, 1m Neuenheimer Feld 325, 69120 Heidelberg, Germany
    不详
    不详
    PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (120-129):
  • [40] Speech based emotion classification
    Nwe, TL
    Wei, FS
    De Silva, LC
    IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 297 - 301