Sentiment Analysis of Bengali Music based on various Audio Features: An analysis of Machine Learning and Deep Learning Methods

被引：0

作者：

Humayra, Atika ^{[1
]}

Sohag, Md Maruf Kamran ^{[1
]}

Anwer, Mohammed ^{[1
]}

Hasan, Mahady ^{[1
]}

机构：

[1] Independent Univ, Dhaka, Bangladesh

来源：

2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024 | 2024年

关键词：

Bengali Music; Sentiment Analysis; Audio Features; Music Information Retrieval; Multi-Class Classification; Machine Learning; Neural Networks;

D O I：

10.1145/3670105.3670155

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Sentiment analysis is a method used to determine the emotional tone or mood conveyed in a text or work of literature. Music functions as a constructive medium for emotional expression, providing a powerful means to communicate and convey feelings. Recently, music sentiment analysis has emerged as a popular method for curating and recommending music to listeners based on their emotional state. Despite the abundant literary legacy of the Bengali language, there are only a limited number of notable works that effectively accomplish the desired objective, and the number of sentiment categories is quite low. Furthermore, these efforts rely exclusively on music lyrics, which may not always be an optimal approach. This is because many lines in a song may lack a literal meaning, making it challenging for classifiers to accurately assign them to the appropriate sentiment category. Furthermore, each song possesses inherent audio characteristics. Therefore, in this research, we propose a novel approach aimed to categorize music sentiments into five distinct classes by utilizing these fundamental audio characteristics. Furthermore, we utilized our own dataset to accomplish the desired outcome. We have employed machine learning and deep learning classifiers to accurately categorize the sentiments of Bengali music into appropriate groups. We used suitable metrics to assess the efficiency of our models. In addition, we have conducted an analysis to determine which intrinsic audio characteristics are most significant in relation to the sentiment categories. Furthermore, our models have demonstrated exceptional performance, with a peak accuracy of 76.79%.

引用

页码：298 / 303

页数：6

共 19 条

[1]

Ahmed T., 2022, 2022 INT C ADV EL EL, DOI [10.1109/icaeee54957.2022.9836434, DOI 10.1109/ICAEEE54957.2022.9836434]

[2]

Al Mamun Afif, 2020, Bangla Music Dataset

[3]

Al Mamunl MA, 2019, PROCEEDINGS OF THE 2019 8TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2019), P397, DOI [10.1109/smart46866.2019.9117400, 10.1109/SMART46866.2019.9117400]

[4]

[Anonymous], 2024, Label encoding in python

[5]

anyscale, What is hyperparameter tuning?

[6]

baeldung, MultiLayer Perceptron vs. Deep Neural Network

[7]

deci, Deep Neural Network (DNN)

[8]

enjoyalgorithms, Introduction to feature scaling: Normalizing and standardization

[9]

flair, Kernel FunctionsIntroduction to SVM Kernel and Examples

[10]

gradient, What is Stochastic Gradient Descent?

← 1 2 →