1D CNN Architectures for Music Genre Classification

被引:17
作者
Allamy, Safaa [1 ]
Koerich, Alessandro Lameiras [1 ]
机构
[1] Univ Quebec, Ecole Technol Super, Montreal, PQ, Canada
来源
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年
关键词
Convolutional neural networks; deep learning; audio processing;
D O I
10.1109/SSCI50451.2021.9659979
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a 1D residual convolutional neural network (CNN) architecture for music genre classification and compares it with other recent 1D CNN architectures. The 1D CNNs learn a representation and a discriminant directly from the raw audio signal. Several convolutional layers capture the time-frequency characteristics of the audio signal and learn various filters relevant to the music genre recognition task. The proposed approach splits the audio signal into overlapped segments using a sliding window to comply with the fixed-length input constraint of the 1D CNNs. As a result, music genre classification can be carried out on a single audio segment or on aggregating the predictions on several audio segments, which improves the final accuracy. The performance of the proposed 1D residual CNN is assessed on a public dataset of 1,000 audio clips. The experimental results have shown that it achieves 80.93% of mean accuracy in classifying music genres and outperforms other 1D CNN architectures.
引用
收藏
页数:7
相关论文
共 50 条
[41]   Automatic Music Genre Classification using Convolution Neural Network [J].
Vishnupriya, S. ;
Meenakshi, K. .
2018 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2018,
[42]   Music Genre Classification Using Independent Recurrent Neural Network [J].
Wu, Wenli ;
Song, Guangxiao ;
Wang, Zhijie ;
Han, Fang .
2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, :192-195
[43]   Music Genre Classification Based on VMD-IWOA-XGBOOST [J].
Gan, Rumeijiang ;
Huang, Tichen ;
Shao, Jin ;
Wang, Fuyu .
MATHEMATICS, 2024, 12 (10)
[44]   Music Genre Classification Based on Chroma Features and Deep Learning [J].
Shi, Leisi ;
Li, Chen ;
Tian, Lihua .
2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2019, :81-86
[45]   Music Genre Classification and Recommendation by Using Machine Learning Techniques [J].
Elbir, Ahmet ;
Cam, Hilmi Bilal ;
Iyican, Mehmet Emre ;
Ozturk, Berkay ;
Aydin, Nizamettin .
2018 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2018, :135-139
[46]   A Comparative Study of DenseNets for Vietnamese Traditional Music Genre Classification [J].
Huy Nhat Nguyen ;
Hung Thanh Le ;
Quan Anh Mai ;
Dung Anh Huvnh ;
Thanh Nhat Tieu ;
Hung Tung Bui ;
Huy Quang .
2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024, 2024, :16-21
[47]   Low-complexity CNN with 1D and 2D filters for super-resolution [J].
Jangsoo Park ;
Jongseok Lee ;
Donggyu Sim .
Journal of Real-Time Image Processing, 2020, 17 :2065-2076
[48]   Low-complexity CNN with 1D and 2D filters for super-resolution [J].
Park, Jangsoo ;
Lee, Jongseok ;
Sim, Donggyu .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) :2065-2076
[49]   An intelligent weather prediction model using optimized 1D CNN with attention GRU [J].
Hemamalini, S. ;
Rani, K. Geetha ;
Rajasekar, B. ;
Sendil, Sadish M. .
GLOBAL NEST JOURNAL, 2024, 26 (02)
[50]   Gesture-Based Drone Control Using Wearable Data and 1D CNN [J].
Leonardo, Diogo ;
Custodio, Joao ;
Ribeiro, Roberto ;
Rodrigues, Nuno ;
Ramos, Joao ;
Pereira, Antonio .
2024 INTERNATIONAL CONFERENCE ON GRAPHICS AND INTERACTION, ICGI, 2024, :118-124