Music Feature Maps with Convolutional Neural Networks for Music Genre Classification

被引:30
作者
Senac, Christine [1 ]
Pellegrini, Thomas [1 ]
Mouret, Florian [1 ]
Pinquier, Julien [1 ]
机构
[1] Univ Toulouse, IRIT, 118 Route Narbonne, F-31062 Toulouse, France
来源
PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI) | 2017年
关键词
convolutional neural networks; music features; music classification;
D O I
10.1145/3095713.3095733
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, deep learning is more and more used for Music Genre Classification: particularly Convolutional Neural Networks (CNN) taking as entry a spectrogram considered as an image on which are sought different types of structure. But, facing the criticism relating to the difficulty in understanding the underlying relationships that neural networks learn in presence of a spectrogram, we propose to use, as entries of a CNN, a small set of eight music features chosen along three main music dimensions: dynamics, timbre and tonality. With CNNs trained in such a way that filter dimensions are interpretable in time and frequency, results show that only eight music features are more efficient than 513 frequency bins of a spectrogram and that late score fusion between systems based on both feature types reaches 91% accuracy on the GTZAN database.
引用
收藏
页数:5
相关论文
共 23 条
  • [1] [Anonymous], 2010, P 13 INT C ART INT S
  • [2] Bergstra James, 2006, MACHINE LEARNING J M
  • [3] Music genre classification using LBP textural features
    Costa, Y. M. G.
    Oliveira, L. S.
    Koerich, A. L.
    Gouyon, F.
    Martins, J. G.
    [J]. SIGNAL PROCESSING, 2012, 92 (11) : 2723 - 2737
  • [4] An evaluation of Convolutional Neural Networks for music classification using spectrograms
    Costa, Yandre M. G.
    Oliveira, Luiz S.
    Silla, Carlos N., Jr.
    [J]. APPLIED SOFT COMPUTING, 2017, 52 : 28 - 38
  • [5] 1977 RIETZ LECTURE - BOOTSTRAP METHODS - ANOTHER LOOK AT THE JACKKNIFE
    EFRON, B
    [J]. ANNALS OF STATISTICS, 1979, 7 (01) : 1 - 26
  • [6] Glorot X., 2011, 14 INT C ART INT STA, P315, DOI DOI 10.1177/1753193410395357
  • [7] SPECTRAL-FLATNESS MEASURE FOR STUDYING AUTOCORRELATION METHOD OF LINEAR PREDICTION OF SPEECH ANALYSIS
    GRAY, AH
    MARKEL, JD
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1974, AS22 (03): : 207 - 217
  • [8] Harte Christopher, 2007, P 1 ACM WORKSH AUD M, P21
  • [9] He K., 2016, INDIAN J CHEM B
  • [10] A Matlab toolbox for Music Information Retrieval
    Lartillot, Olivier
    Toiviainen, Petri
    Eerola, Tuomas
    [J]. DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 261 - 268