Optimizing the configuration of deep learning models for music genre classification

被引:8
作者
Li, Teng [1 ]
机构
[1] Pingdingshan Polytenchn Coll, Acad Arts, Pingdingshan 467000, Henan, Peoples R China
关键词
Deep reinforcement learning; Convolutional neural network; Signal processing; Music genre classification;
D O I
10.1016/j.heliyon.2024.e24892
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Music genre categorization is a fundamental use of sound processing methods in the realm of music retrieval. Typically, people are responsible for categorizing music genres. Machine learning approaches can automate this procedure. Therefore, in recent years, several approaches have been suggested to achieve this objective. Nevertheless, the given findings indicate that there is still a discrepancy between the observed results and an optimal categorization method. Hence, this paper introduces a novel approach for accurately forecasting music genres by using deep learning methodologies. The proposed approach involves preprocessing the input signals and then representing the characteristics of each signal using a combination of Mel Frequency Cepstral Coefficients (MFCC) and Short-Time Fourier Transform (STFT) features. Subsequently, a convolutional neural network (CNN) is applied to process each group of these characteristics. The proposed technique utilizes two CNN models to analyze MFCC and STFT data. Although the structure of these models is identical, the hyper-parameters of each model are individually adjusted using the black hole optimization (BHO) algorithm. Here, the optimization method finetunes the hyperparameters of each CNN model to minimize their training error. Ultimately, the results of two Convolutional Neural Network (CNN) models are combined to determine the music genre using a classifier based on SoftMax. The efficacy of the suggested methodology in categorizing music genres has been assessed using the GTZAN and Extended-Ballroom datasets. The experimental findings demonstrated that the suggested approach achieved classification accuracies of 95.2 % and 95.7 % in the two datasets, respectively, indicating its superiority over earlier efforts.
引用
收藏
页数:13
相关论文
共 50 条
[21]   Deep Neural Networks with Depthwise Separable Convolution for Music Genre Classification [J].
Liang, Yunming ;
Zhou, Yi ;
Wan, Tongtang ;
Shu, Xiaofeng .
2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, :267-270
[22]   Texture selection for automatic music genre classification [J].
Foleis, Juliano Henrique ;
Tavares, Tiago Fernandes .
APPLIED SOFT COMPUTING, 2020, 89
[23]   Machine Learning for Music Genre Classification Using Visual Mel Spectrum [J].
Cheng, Yu-Huei ;
Kuo, Che-Nan .
MATHEMATICS, 2022, 10 (23)
[24]   Representation Learning vs. Handcrafted Features for Music Genre Classification [J].
Pereira, Rodolfo M. ;
Costa, Yandre M. G. ;
Aguiar, Rafael L. ;
Britto, Alceu S., Jr. ;
Oliveira, Luiz E. S. ;
Silla, Carlos N., Jr. .
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[25]   Improved Music Genre Classification with Convolutional Neural Networks [J].
Zhang, Weibin ;
Lei, Wenkang ;
Xu, Xiangmin ;
Xing, Xiaofeng .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :3304-3308
[26]   Music genre classification via likelihood fusion from multiple feature models [J].
Shiu, Y ;
Kuo, CCJ .
STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2005, 2005, 5682 :258-268
[27]   Sketch Classification with Deep Learning Models [J].
Eyiokur, Fevziye Irem ;
Yaman, Dogucan ;
Ekenel, Hazim Kemal .
2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[28]   Music Genre Classification Based on Paraconsistency [J].
Silva Paulo, Katia Cristina ;
Solgon Bassi, Regiane Denise ;
Delorme, Andre Luis ;
Guido, Rodrigo Capobianco ;
da Silva, Ivan Nunes .
2ND INTERNATIONAL CONFERENCE ON ADVANCED EDUCATION TECHNOLOGY AND MANAGEMENT SCIENCE (AETMS 2014), 2015, :427-431
[29]   Genre classification of symbolic pieces of music [J].
Armentano, Marcelo G. ;
De Noni, Walter A. ;
Cardoso, Hernan F. .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 48 (03) :579-599
[30]   EXPLOITING GENRE FOR MUSIC EMOTION CLASSIFICATION [J].
Lin, Yu-Ching ;
Yang, Yi-Hsuan ;
Chen, Homer H. ;
Liao, I-Bin ;
Ho, Yeh-Chin .
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, :618-+