Optimizing the configuration of deep learning models for music genre classification

被引:5
|
作者
Li, Teng [1 ]
机构
[1] Pingdingshan Polytenchn Coll, Acad Arts, Pingdingshan 467000, Henan, Peoples R China
关键词
Deep reinforcement learning; Convolutional neural network; Signal processing; Music genre classification;
D O I
10.1016/j.heliyon.2024.e24892
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Music genre categorization is a fundamental use of sound processing methods in the realm of music retrieval. Typically, people are responsible for categorizing music genres. Machine learning approaches can automate this procedure. Therefore, in recent years, several approaches have been suggested to achieve this objective. Nevertheless, the given findings indicate that there is still a discrepancy between the observed results and an optimal categorization method. Hence, this paper introduces a novel approach for accurately forecasting music genres by using deep learning methodologies. The proposed approach involves preprocessing the input signals and then representing the characteristics of each signal using a combination of Mel Frequency Cepstral Coefficients (MFCC) and Short-Time Fourier Transform (STFT) features. Subsequently, a convolutional neural network (CNN) is applied to process each group of these characteristics. The proposed technique utilizes two CNN models to analyze MFCC and STFT data. Although the structure of these models is identical, the hyper-parameters of each model are individually adjusted using the black hole optimization (BHO) algorithm. Here, the optimization method finetunes the hyperparameters of each CNN model to minimize their training error. Ultimately, the results of two Convolutional Neural Network (CNN) models are combined to determine the music genre using a classifier based on SoftMax. The efficacy of the suggested methodology in categorizing music genres has been assessed using the GTZAN and Extended-Ballroom datasets. The experimental findings demonstrated that the suggested approach achieved classification accuracies of 95.2 % and 95.7 % in the two datasets, respectively, indicating its superiority over earlier efforts.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] EVALUATION OF PARALLEL AND SEQUENTIAL DEEP LEARNING MODELS FOR MUSIC SUBGENRE CLASSIFICATION
    Feng, Miria
    Feng, Wenying
    MATHEMATICAL FOUNDATIONS OF COMPUTING, 2021, 4 (02): : 131 - 143
  • [32] Deep Neural Networks: A Case Study for Music Genre Classification
    Rajanna, Arjun Raj
    Aryafar, Kamelia
    Shokoufandeh, Ali
    Ptucha, Raymond
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 655 - 660
  • [33] Dissecting the genre of Nigerian music with machine learning models
    Folorunso, Sakinat O.
    Afolabi, Sulaimon A.
    Owodeyi, Adeoye B.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6266 - 6279
  • [34] Web Application for Machine Learning based Music Genre Classification
    Chauhan, Jugal
    Shah, Jash
    Mundhe, Eeshan
    Jain, Ishan
    2021 7th IEEE International Conference on Advances in Computing, Communication and Control, ICAC3 2021, 2021,
  • [35] A Middle-Level Learning Feature Interaction Method with Deep Learning for Multi-Feature Music Genre Classification
    Liu, Jinliang
    Wang, Changhui
    Zha, Lijuan
    ELECTRONICS, 2021, 10 (18)
  • [36] Music Genre Classification using On-line Dictionary Learning
    Srinivas, M.
    Roy, Debaditya
    Mohan, C. Krishna
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1937 - 1941
  • [37] Machine Learning Evaluation for Music Genre Classification of Audio Signals
    Dabas, Chetna
    Agarwal, Aditya
    Gupta, Naman
    Jain, Vaibhav
    Pathak, Siddhant
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2020, 12 (03) : 57 - 67
  • [38] Applying supervised learning techniques to Brazilian music genre classification
    2020 XLVI LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2020), 2021, : 102 - 107
  • [39] Statistical and Deep Learning Approaches for Literary Genre Classification
    Goyal, Anshaj
    Prakash, V. Prem
    ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 297 - 305
  • [40] Genre Classification using Word Embeddings and Deep Learning
    Kumar, Akshi
    Rajpal, Arjun
    Rathore, Dushyant
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2142 - 2146