Music-CRN: an Efficient Content-Based Music Classification and Recommendation Network

被引:9
作者
Mao, Yuxu [1 ]
Zhong, Guoqiang [1 ]
Wang, Haizhen [1 ]
Huang, Kaizhu [2 ,3 ]
机构
[1] Ocean Univ China, Coll Comp Sci & Technol, Qingdao, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
[3] Duke Kunshan Univ, Div Nat & Appl Sci, Suzhou, Peoples R China
关键词
Music classification; Music recommendation; Content-based recommendation; Convolutional neural networks; Music spectrogram dataset;
D O I
10.1007/s12559-022-10039-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For human beings, music is generally perceived, categorized, and enjoyed based on its attributes, such as rhythm, pitch, timbre, and harmony. In recent years, due to their high performances, content-based music classification and recommendation systems have attracted much attention from both the music industry and research community. However, on the one hand, deep music classification models are still very rare, and on the other hand, the collaborative filtering approach, which has the cold start problem, still dominates the music recommendation applications. In this paper, we propose Music-CRN (short for music classification and recommendation network), a simple yet effective model that facilitates music classification and recommendation with learning the audio content features of music. Specifically, to extract the content features of music, the audio is converted into spectrogram "images" by Fourier transformation. Music-CRN can be applied on the spectrograms as similar as natural images to effectively extract music content features. Additionally, we collect a new dataset containing nearly 200,000 music spectrogram slices. To the best of our knowledge, this is the first publicly available music spectrogram dataset, which is at . We compare Music-CRN to previous content-based music classification and recommendation models on the collected dataset. Experimental results show that Music-CRN achieves state-of-the-art performance on music classification and recommendation tasks, demonstrating its superiority over previous methods.
引用
收藏
页码:2306 / 2316
页数:11
相关论文
共 32 条
[1]  
[Anonymous], 2015, P 16 INT SOC MUS INF
[2]   Aggregate features and ADABOOST for music classification [J].
Bergstra, James ;
Casagrande, Norman ;
Erhan, Dumitru ;
Eck, Douglas ;
Kegl, Balazs .
MACHINE LEARNING, 2006, 65 (2-3) :473-484
[3]   Audio-Based Music Classification with DenseNet and Data Augmentation [J].
Bian, Wenhao ;
Wang, Jie ;
Zhuang, Bojin ;
Yang, Jiankui ;
Wang, Shaojun ;
Xiao, Jing .
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 :56-65
[4]  
Cano P., 2005, 13th Annual ACM International Conference on Multimedia, P211, DOI 10.1145/1101149.1101181
[5]   Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention [J].
Chen, Jingyuan ;
Zhang, Hanwang ;
He, Xiangnan ;
Nie, Liqiang ;
Liu, Wei ;
Chua, Tat-Seng .
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :335-344
[6]  
Choi K, 2017, ARXIV170309179
[7]   Music Personalized Recommendation System Based on Hybrid Filtration [J].
Dan, Wu .
2019 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA & SMART CITY (ICITBS), 2019, :430-433
[8]  
Dieleman Sander, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6964, DOI 10.1109/ICASSP.2014.6854950
[9]  
Dieleman S., 2011, P 12 INT SOC MUS INF, P669, DOI [10.5281/zenodo.1415188, DOI 10.5281/ZENODO.1415188]
[10]  
Garcia-Garcia D., 2010, Proceedings of the 2010 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), P266, DOI 10.1109/MLSP.2010.5589240