INSTRUMENTATION-BASED MUSIC SIMILARITY USING SPARSE REPRESENTATIONS

被引:0
作者
Fujihara, Hiromasa [1 ]
Klapuri, Anssi [1 ]
Plumbley, Mark D. [1 ]
机构
[1] Queen Mary Univ London, Ctr Digital Mus, London, England
来源
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年
基金
英国工程与自然科学研究理事会;
关键词
Music similarity; Instrumentation; Sparse representation; Online dictionary learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a novelmusic similarity calculation method that is based on the instrumentation of music pieces. The approach taken here is based on the idea that sparse representations of musical audio signals are a rich source of information regarding the elements that constitute the observed spectra. We propose a method to extract feature vectors based on sparse representations and use these to calculate a similarity measure between songs. To train a dictionary for sparse representations from a large amount of training data, a novel dictionary-initialization method based on agglomerative clustering is proposed. An objective evaluation shows that the new features improve the performance of similarity calculation compared to the standard mel-frequency cepstral coefficients features.
引用
收藏
页码:433 / 436
页数:4
相关论文
共 20 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]  
[Anonymous], P 11 ISMIR
[3]  
[Anonymous], 2000, Pattern Classification
[4]   A large-scale evaluation of acoustic and subjective music-similarity measures [J].
Berenzweig, A ;
Logan, B ;
Ellis, DPW ;
Whitman, B .
COMPUTER MUSIC JOURNAL, 2004, 28 (02) :63-76
[5]  
Breese J. S., 1998, Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference (1998), P43
[6]   The music information retrieval evaluation exchange (2005-2007): A window into music information retrieval research [J].
Downie, J. Stephen .
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2008, 29 (04) :247-255
[7]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[8]   A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval [J].
Fujihara, Hiromasa ;
Goto, Masataka ;
Kitahara, Tetsuro ;
Okuno, Hiroshi G. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03) :638-648
[9]   Musical genre classification using nonnegative matrix factorization-based features [J].
Holzapfel, Andre ;
Stylianou, Yannis .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (02) :424-434
[10]  
Hoyer PO, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P557, DOI 10.1109/NNSP.2002.1030067