Optimizing the configuration of deep learning models for music genre classification

被引:5
|
作者
Li, Teng [1 ]
机构
[1] Pingdingshan Polytenchn Coll, Acad Arts, Pingdingshan 467000, Henan, Peoples R China
关键词
Deep reinforcement learning; Convolutional neural network; Signal processing; Music genre classification;
D O I
10.1016/j.heliyon.2024.e24892
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Music genre categorization is a fundamental use of sound processing methods in the realm of music retrieval. Typically, people are responsible for categorizing music genres. Machine learning approaches can automate this procedure. Therefore, in recent years, several approaches have been suggested to achieve this objective. Nevertheless, the given findings indicate that there is still a discrepancy between the observed results and an optimal categorization method. Hence, this paper introduces a novel approach for accurately forecasting music genres by using deep learning methodologies. The proposed approach involves preprocessing the input signals and then representing the characteristics of each signal using a combination of Mel Frequency Cepstral Coefficients (MFCC) and Short-Time Fourier Transform (STFT) features. Subsequently, a convolutional neural network (CNN) is applied to process each group of these characteristics. The proposed technique utilizes two CNN models to analyze MFCC and STFT data. Although the structure of these models is identical, the hyper-parameters of each model are individually adjusted using the black hole optimization (BHO) algorithm. Here, the optimization method finetunes the hyperparameters of each CNN model to minimize their training error. Ultimately, the results of two Convolutional Neural Network (CNN) models are combined to determine the music genre using a classifier based on SoftMax. The efficacy of the suggested methodology in categorizing music genres has been assessed using the GTZAN and Extended-Ballroom datasets. The experimental findings demonstrated that the suggested approach achieved classification accuracies of 95.2 % and 95.7 % in the two datasets, respectively, indicating its superiority over earlier efforts.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Deep Belief Networks for Automatic Music Genre Classification
    Yang, Xiaohong
    Chen, Qingcai
    Zhou, Shusen
    Wang, Xiaolong
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2444 - 2447
  • [22] "Multilingual" Deep Neural Network For Music Genre Classification
    Dai, Jia
    Liu, Wenju
    Ni, Chongjia
    Dong, Like
    Yang, Hong
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2907 - 2911
  • [23] Automatic Song Genre Classification in Bengali Music: A Comparative Study of Machine Learning and Deep Learning Approaches
    Humayra, Atika
    Sohag, Md Maruf Kamran
    Anwer, Mohammed
    Hasan, Mahady
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 273 - 277
  • [24] Co-occurrence models in music genre classification
    Ahrendt, P
    Larsen, J
    Goutte, C
    2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 247 - 252
  • [25] MUSIC GENRE CLASSIFICATION USING GAUSSIAN PROCESS MODELS
    Markov, Konstantin
    Matsui, Tomoko
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [26] Music Genre Classification Using Polyphonic Timbre Models
    de Leon, Franz A.
    Martinez, Kirk
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 415 - 420
  • [27] Deep learning for video game genre classification
    Jiang, Yuhang
    Zheng, Lukun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (14) : 21085 - 21099
  • [28] Deep learning for video game genre classification
    Yuhang Jiang
    Lukun Zheng
    Multimedia Tools and Applications, 2023, 82 : 21085 - 21099
  • [29] A machine learning approach to automatic music genre classification
    Silla, Carlos N.
    Koerich, Alessandro L.
    Kaestner, Celso A. A.
    Journal of the Brazilian Computer Society, 2008, 14 (03) : 7 - 18
  • [30] An intelligent music genre analysis using feature extraction and classification using deep learning techniques
    Wang Hongdan
    SalmiJamali, Siti
    Chen Zhengping
    Shan Qiaojuan
    Ren Le
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100