Generative Adversarial Networks Based Framework for Music Genre Classification

被引:0
|
作者
Pulkit Dwivedi [1 ]
Benazir Islam [2 ]
机构
[1] School of Computer Science and Engineering, IILM University, Greater Noida
[2] New Jersey Institute of Technology, Newark
关键词
Classification models; Deep learning; Feature extraction; Generative adversarial networks (GANs); Music genre classification;
D O I
10.1007/s42979-024-03531-8
中图分类号
学科分类号
摘要
Music genre classification plays a crucial role in organizing and exploring large music collections, enabling personalized music recommendations, and enhancing music-related services. This paper presents a novel approach to music genre classification using Generative Adversarial Networks (GANs), Fourier Transform, and Wavelet Transform. The main objective is to leverage the power of GANs to extract discriminative features from audio data and accurately classify music into different genres. The proposed methodology involves two key components: the generator and the discriminator. The generator generates synthetic audio samples that resemble real music, while the discriminator learns to distinguish between real and synthetic audio samples. By training the GAN on a diverse dataset of music samples from various genres, the discriminator becomes proficient in recognizing genre-specific features. To enhance classification accuracy, Fourier Transform and Wavelet Transform are applied to extract both frequency and time-domain features from the audio data. Additionally, classifiers such as support vector machines and neural networks are employed to effectively distinguish between different music genres. The experimental results demonstrate the effectiveness of the proposed approach across multiple datasets. The method achieves 98.97% accuracy on the GTZAN dataset, 92.47% accuracy on the FMA-Small dataset, and 92.98% accuracy on the ISMIR Genre dataset, significantly outperforming traditional classification methods These results highlight the power of GANs, Fourier Transform, and Wavelet Transform in enhancing the accuracy and robustness of music genre classification. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024.
引用
收藏
相关论文
共 50 条
  • [1] Distance Constraint-Based Generative Adversarial Networks for Hyperspectral Image Classification
    Qin, Anyong
    Tan, Zhuolin
    Wang, Ran
    Sun, Yongqing
    Yang, Feng
    Zhao, Yue
    Gao, Chenqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [2] Generative Adversarial Networks for Classification
    Israel, Steven A.
    Goldstein, J. H.
    Klein, Jeffrey S.
    Talamonti, James
    Tanner, Franklin
    Zabel, Shane
    Sallee, Philip A.
    McCoy, Lisa
    2017 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2017,
  • [3] Recurrent Neural Networks for Music Genre Classification
    Kakarla, Chaitanya
    Eshwarappa, Vidyashree
    Saheer, Lakshmi Babu
    Oghaz, Mahdi Maktabdar
    ARTIFICIAL INTELLIGENCE XXXIX, AI 2022, 2022, 13652 : 267 - 279
  • [4] Semisupervised Hyperspectral Image Classification Based on Generative Adversarial Networks
    Zhan, Ying
    Hu, Dan
    Wang, Yuntao
    Yu, Xianchuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (02) : 212 - 216
  • [5] Cancer classification with data augmentation based on generative adversarial networks
    Wei, Kaimin
    Li, Tianqi
    Huang, Feiran
    Chen, Jinpeng
    He, Zefan
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (02)
  • [6] Cancer classification with data augmentation based on generative adversarial networks
    Kaimin Wei
    Tianqi Li
    Feiran Huang
    Jinpeng Chen
    Zefan He
    Frontiers of Computer Science, 2022, 16
  • [7] Cancer classification with data augmentation based on generative adversarial networks
    WEI Kaimin
    LI Tianqi
    HUANG Feiran
    CHEN Jinpeng
    HE Zefan
    Frontiers of Computer Science, 2022, 16 (02)
  • [8] Generative Adversarial Networks in Retinal Image Classification
    Mercaldo, Francesco
    Brunese, Luca
    Martinelli, Fabio
    Santone, Antonella
    Cesarelli, Mario
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [9] Generative Adversarial Networks for Hyperspectral Image Classification
    Zhu, Lin
    Chen, Yushi
    Ghamisi, Pedram
    Benediktsson, Jon Atli
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (09): : 5046 - 5063
  • [10] Classification of Hyperspectral Images via Multitask Generative Adversarial Networks
    Hang, Renlong
    Zhou, Feng
    Liu, Qingshan
    Ghamisi, Pedram
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (02): : 1424 - 1436