Music genre classification using deep learning: a comparative analysis of CNNs and RNNs

被引:0
作者
Xu, Wenyi [1 ]
机构
[1] Early Childhood Education College, Huaihua Normal College, Hunan, Huaihua
关键词
CNN; Deep learning; Feature extraction; Music genre classification; RNN;
D O I
10.2478/amns-2024-3309
中图分类号
学科分类号
摘要
Music categorization as a key component of music information retrieval is an important means of dealing with massive amounts of music information. Based on this, this paper proposes the use of deep learning algorithms to automatically identify music genre factions in a classification method. After denoising the audio using wavelet thresholding, the method of combining audio characteristics and spectral image features is used to enhance the audio data, short-time Fourier transforms and Meier transforms are used to realize audio feature extraction, and then convolutional neural network and recurrent neural network are used as the classifiers respectively to construct the music genre classification model. The GTZAN dataset is used as an experimental object to explore the effects of data enhancement and feature extraction on the music classification effect and to compare and analyze the performance of the CNN model and the RNN model. When the data enhancement methods based on audio features and image features are used simultaneously, the music classification accuracy of the model is above 90%, and the model using a short-time Fourier transform to extract audio features has a better classification effect. The CNN model is 1.3% more effective than the RNN model in classifying music genres, and this result is still consistent even if the audio is processed with variable speed pitch change and structural change, so the CNN model has better music genre classification. © 2024 Wenyi Xu., published by Sciendo.
引用
收藏
相关论文
共 23 条
[1]  
Pelchat N., Gelowitz C.M., Neural network music genre classification, Canadian Journal of Electrical and Computer Engineering, 43, 3, pp. 170-173, (2020)
[2]  
Oramas S., Barbieri F., Nieto Caballero O., Serra X., Multimodal deep learning for music genre classification, Transactions of the International Society for Music Information Retrieval, 1, 1, pp. 4-21, (2018)
[3]  
Costa Y.M., Oliveira L.S., Koerich A.L., Gouyon F., Martins J.G., Music genre classification using LBP textural features, Signal Processing, 92, 11, pp. 2723-2737, (2012)
[4]  
Nanni L., Costa Y.M., Lumini A., Kim M.Y., Baek S.R., Combining visual and acoustic features for music genre classification, Expert Systems with Applications, 45, pp. 108-117, (2016)
[5]  
Ghildiyal A., Singh K., Sharma S., Music genre classification using machine learning, 2020 4th international conference on electronics, communication and aerospace technology (ICECA), pp. 1368-1372, (2020)
[6]  
Ndou N., Ajoodha R., Jadhav A., Music genre classification: A review of deep-learning and traditional machine-learning approaches, 2021 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS), pp. 1-6, (2021)
[7]  
Elbir A., Aydin N., Music genre classification and music recommendation by using deep learning, Electronics Letters, 56, 12, pp. 627-629, (2020)
[8]  
Brunner G., Wang Y., Wattenhofer R., Zhao S., Symbolic music genre transfer with cyclegan, 2018 ieee 30th international conference on tools with artificial intelligence (ictai), pp. 786-793, (2018)
[9]  
Goulart A.J.H., Guido R.C., Maciel C.D., Exploring different approaches for music genre classification, Egyptian Informatics Journal, 13, 2, pp. 59-63, (2012)
[10]  
Yu Y., Luo S., Liu S., Qiao H., Liu Y., Feng L., Deep attention based music genre classification, Neurocomputing, 372, pp. 84-91, (2020)