Automatic Musical Pattern Feature Extraction Using Convolutional Neural Network

被引:0
作者
Li, Tom L. H. [1 ]
Chan, Antoni B. [1 ]
Chun, Andy H. W. [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
来源
INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III | 2010年
关键词
music feature extractor; music information retrieval; convolutional neural network; multimedia data mining;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Music genre classification has been a challenging yet promising task in the field of music information retrieval (MIR). Due to the highly elusive characteristics of audio musical data, retrieving informative and reliable features from audio signals is crucial to the performance of any music genre classification system. Previous work on audio music genre classification systems mainly concentrated on using timbral features, which limits the performance. To address this problem, we propose a novel approach to extract musical pattern features in audio music using convolutional neural network (CNN), a model widely adopted in image information retrieval tasks. Our experiments show that CNN has strong capacity to capture informative features from the variations of musical patterns with minimal prior knowledge provided.
引用
收藏
页码:546 / 550
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2007, LARGE SCALE KERNEL M
[2]  
Basili R., 2004, P ISMIR
[3]   Aggregate features and ADABOOST for music classification [J].
Bergstra, James ;
Casagrande, Norman ;
Erhan, Dumitru ;
Eck, Douglas ;
Kegl, Balazs .
MACHINE LEARNING, 2006, 65 (2-3) :473-484
[4]   Musical style identification using self-organising maps [J].
de León, PJP ;
Inesta, JM .
SECOND INTERNATIONAL CONFERENCE ON WEB DELIVERING OF MUSIC, PROCEEDINGS, 2002, :82-89
[5]  
DESHPANDE H, 2001, P COST G6 C DIG AUD
[6]  
Ellis D. P. W., 2007, DINS P ISMIR
[7]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[8]  
Li T, 2003, 2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, P143
[9]  
Lidy T., P 6 INT C MUS INF RE, P34
[10]  
Lidy T., 2007, P ISMIR VIENN AUSTR