Exploring Data Augmentation to Improve Music Genre Classification with ConvNets

被引:0
作者
Aguiar, Rafael L. [1 ]
Costa, Yandre M. G. [2 ]
Silla Jr, Carlos N. [1 ]
机构
[1] Pontifical Catholic Univ Parana PUCPR, Postgrad Program Informat PPGIa, Curitiba, Parana, Brazil
[2] State Univ Maringa UEM, Grad Program Comp Sci PCC, Maringa, Parana, Brazil
来源
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2018年
关键词
Data augmentation; Music information retrieval; Automatic music genre classification; Spectrograms; Deep learning; Convolutional Neural Networs; CONVOLUTIONAL NEURAL-NETWORKS; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we address the automatic music genre classification as a pattern recognition task. The content of the music pieces were handled in the visual domain, using spectrograms created from the audio signal. This kind of image has been successfully used in this task since 2011 by extracting handcrafted features based on texture, since it is the main visual attribute found in spectrograms. In this work, the patterns were described by representation learning obtained with the use of convolutional neural network (CNN). CNN is a deep learning architecture and it has been widely used in the pattern recognition literature. Overfitting is a recurrent problem when a classification task is addressed by using CNN, it may occur due to the lack of training samples and/or due to the high dimensionality of the space. To increase the generalization capability we propose to explore data augmentation techniques. In this work, we have carefully selected strategies of data augmentation that are suitable for this kind of application, which are: adding noise, pitch shifting, loudness variation and time stretching. Experiments were conducted on the Latin Music Database (LMD), and the best obtained accuracy overcame the state of the art considering approaches based only in CNN.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Music Genre Classification Using Contrastive Dissimilarity
    Costanzi, Gabriel Henrique
    Teixeira, Lucas O.
    Felipe, Gustavo Z.
    Cavalcanti, George D. C.
    Costa, Yandre M. G.
    2024 31ST INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, IWSSIP 2024, 2024,
  • [32] Brain tumors classification with deep learning using data augmentation
    Gurkahraman, Kali
    Karakis, Rukiye
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2021, 36 (02): : 997 - 1011
  • [33] Robustness of musical features on deep learning models for music genre classification
    Singh, Yeshwant
    Biswas, Anupam
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
  • [34] Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music
    Kratimenos, Agelos
    Avramidis, Kleanthis
    Garoufis, Christos
    Zlatintsi, Athanasia
    Maragos, Petros
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 156 - 160
  • [35] Data Augmentation: Using Channel-Level Recombination to Improve Classification Performance for Motor Imagery EEG
    Pei, Yu
    Luo, Zhiguo
    Yan, Ye
    Yan, Huijiong
    Jiang, Jing
    Li, Weiguo
    Xie, Liang
    Yin, Erwei
    FRONTIERS IN HUMAN NEUROSCIENCE, 2021, 15
  • [36] Music Feature Maps with Convolutional Neural Networks for Music Genre Classification
    Senac, Christine
    Pellegrini, Thomas
    Mouret, Florian
    Pinquier, Julien
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [37] Data Augmentation in Histopathological Classification: An Analysis Exploring GANs with XAI and Vision Transformers
    Rozendo, Guilherme Botazzo
    Garcia, Bianca Lanconi de Oliveira
    Borgue, Vinicius Augusto Toreli
    Lumini, Alessandra
    Tosta, Thaina Aparecida Azevedo
    do Nascimento, Marcelo Zanchetta
    Neves, Leandro Alves
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [38] Data augmentation strategies to improve text classification: a use case in smart cities
    Bencke, Luciana
    Moreira, Viviane Pereira
    LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (02) : 659 - 694
  • [39] Data augmentation strategies to improve text classification: a use case in smart cities
    Bencke, Luciana
    Moreira, Viviane Pereira
    LANGUAGE RESOURCES AND EVALUATION, 2023,
  • [40] PreAugNet: improve data augmentation for industrial defect classification with small-scale training data
    Farady, Isack
    Lin, Chih-Yang
    Chang, Ming-Ching
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (03) : 1233 - 1246