Optimizing the configuration of deep learning models for music genre classification

被引：5

作者：

Li, Teng ^{[1
]}

机构：

[1] Pingdingshan Polytenchn Coll, Acad Arts, Pingdingshan 467000, Henan, Peoples R China

来源：

HELIYON | 2024年 / 10卷 / 02期

关键词：

Deep reinforcement learning; Convolutional neural network; Signal processing; Music genre classification;

D O I：

10.1016/j.heliyon.2024.e24892

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Music genre categorization is a fundamental use of sound processing methods in the realm of music retrieval. Typically, people are responsible for categorizing music genres. Machine learning approaches can automate this procedure. Therefore, in recent years, several approaches have been suggested to achieve this objective. Nevertheless, the given findings indicate that there is still a discrepancy between the observed results and an optimal categorization method. Hence, this paper introduces a novel approach for accurately forecasting music genres by using deep learning methodologies. The proposed approach involves preprocessing the input signals and then representing the characteristics of each signal using a combination of Mel Frequency Cepstral Coefficients (MFCC) and Short-Time Fourier Transform (STFT) features. Subsequently, a convolutional neural network (CNN) is applied to process each group of these characteristics. The proposed technique utilizes two CNN models to analyze MFCC and STFT data. Although the structure of these models is identical, the hyper-parameters of each model are individually adjusted using the black hole optimization (BHO) algorithm. Here, the optimization method finetunes the hyperparameters of each CNN model to minimize their training error. Ultimately, the results of two Convolutional Neural Network (CNN) models are combined to determine the music genre using a classifier based on SoftMax. The efficacy of the suggested methodology in categorizing music genres has been assessed using the GTZAN and Extended-Ballroom datasets. The experimental findings demonstrated that the suggested approach achieved classification accuracies of 95.2 % and 95.7 % in the two datasets, respectively, indicating its superiority over earlier efforts.

引用

页数：13

共 50 条

[1] Robustness of musical features on deep learning models for music genre classification
Singh, Yeshwant
Biswas, Anupam
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
[2] Music Genre Classification Based on Deep Learning
Zhang, Wenlong
MOBILE INFORMATION SYSTEMS, 2022, 2022
[3] Music genre classification and music recommendation by using deep learning
Elbir, A.
Aydin, N.
ELECTRONICS LETTERS, 2020, 56 (12) : 627 - 629
[4] A Music Genre Classification Method Based on Deep Learning
He, Qi
MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
[5] Music Genre Classification using Deep learning - A review
Prince, Shajin
Thomas, Justin Jojy
Sharon Jostana, J.
Priya, Kakarla Preethi
Daniel, J Joshua
6th IEEE International Conference on Computational System and Information Technology for Sustainable Solutions, CSITSS 2022, 2022,
[6] Music Genre Classification Based on Chroma Features and Deep Learning
Shi, Leisi
Li, Chen
Tian, Lihua
2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2019, : 81 - 86
[7] Ensemble of deep learning, visual and acoustic features for music genre classification
Nanni, Loris
Costa, Yandre M. G.
Aguiar, Rafael L.
Silla, Carlos N., Jr.
Brahnam, Sheryl
JOURNAL OF NEW MUSIC RESEARCH, 2018, 47 (04) : 383 - 397
[8] Jazz Music Sub-Genre Classification Using Deep Learning
Quinto, Rene Josiah M.
Atienza, Rowel O.
Tiglao, Nestor Michael C.
TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 3111 - 3116
[9] Jazz music sub-genre classification using deep learning
Quinto, Rene Josiah M.
Atienza, Rowel O.
Tiglao, Nestor Michael C.
IEEE Region 10 Annual International Conference, Proceedings/TENCON, 2017, 2017-December : 3111 - 3116
[10] Implementation of Deep Learning Models on an SoC-FPGA Device for Real-Time Music Genre Classification
Faizan, Muhammad
Intzes, Ioannis
Cretu, Ioana
Meng, Hongying
TECHNOLOGIES, 2023, 11 (04)

← 1 2 3 4 5 →