1D CNN Architectures for Music Genre Classification

被引:13
作者
Allamy, Safaa [1 ]
Koerich, Alessandro Lameiras [1 ]
机构
[1] Univ Quebec, Ecole Technol Super, Montreal, PQ, Canada
来源
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年
关键词
Convolutional neural networks; deep learning; audio processing;
D O I
10.1109/SSCI50451.2021.9659979
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a 1D residual convolutional neural network (CNN) architecture for music genre classification and compares it with other recent 1D CNN architectures. The 1D CNNs learn a representation and a discriminant directly from the raw audio signal. Several convolutional layers capture the time-frequency characteristics of the audio signal and learn various filters relevant to the music genre recognition task. The proposed approach splits the audio signal into overlapped segments using a sliding window to comply with the fixed-length input constraint of the 1D CNNs. As a result, music genre classification can be carried out on a single audio segment or on aggregating the predictions on several audio segments, which improves the final accuracy. The performance of the proposed 1D residual CNN is assessed on a public dataset of 1,000 audio clips. The experimental results have shown that it achieves 80.93% of mean accuracy in classifying music genres and outperforms other 1D CNN architectures.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] LAND COVER CLASSIFICATION FOR SATELLITE IMAGES THROUGH 1D CNN
    Song, Yang
    Zhang, Zhifei
    Baghbaderani, Razieh Kaviani
    Wang, Fanqi
    Qu, Ying
    Stutts, Craig
    Qi, Hairong
    2019 10TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING - EVOLUTION IN REMOTE SENSING (WHISPERS), 2019,
  • [2] A Storyteller's tale: Literature audiobooks Genre classification using CNN and RNN architectures
    Carmi, Nehory
    Cohen, Azaria
    Avigal, Mireille
    Lerner, Anat
    INTERSPEECH 2019, 2019, : 3387 - 3390
  • [3] A Hybrid Parallel Computing Architecture Based on CNN and Transformer for Music Genre Classification
    Chen, Jiyang
    Ma, Xiaohong
    Li, Shikuan
    Ma, Sile
    Zhang, Zhizheng
    Ma, Xiaojing
    ELECTRONICS, 2024, 13 (16)
  • [4] 1D CNN with BLSTM for automated classification of fixations, saccades, and smooth pursuits
    Startsev, Mikhail
    Agtzidis, Ioannis
    Dorr, Michael
    BEHAVIOR RESEARCH METHODS, 2019, 51 (02) : 556 - 572
  • [5] 1D CNN with BLSTM for automated classification of fixations, saccades, and smooth pursuits
    Mikhail Startsev
    Ioannis Agtzidis
    Michael Dorr
    Behavior Research Methods, 2019, 51 : 556 - 572
  • [6] MS-SincResNet: Joint learning of 1D and 2D kernels using multi-scale SincNet and ResNet for music genre classification
    Chang, Pei-Chun
    Chen, Yong-Sheng
    Lee, Chang-Hsing
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 29 - 36
  • [7] Evolving and Ensembling Deep CNN Architectures for Image Classification
    Fielding, Ben
    Lawrence, Tom
    Zhang, Li
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [8] Neural Network Music Genre Classification
    Pelchat, Nikki
    Gelowitz, Craig M.
    CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE, 2020, 43 (03): : 170 - 173
  • [9] Kurdish Dialect Recognition using 1D CNN
    Ghafoor, Karzan J.
    Rawf, Karwan M. Hama
    Abdulrahman, Ayub O.
    Taher, Sarkhel H.
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2021, 9 (02):
  • [10] A Short Survey and Comparison of CNN-Based Music Genre Classification Using Multiple Spectral Features
    Seo, Wangduk
    Cho, Sung-Hyun
    Teisseyre, Pawe
    Lee, Jaesung
    IEEE ACCESS, 2024, 12 : 245 - 257