INSTRUMENTATION-BASED MUSIC SIMILARITY USING SPARSE REPRESENTATIONS

被引：0

作者：

Fujihara, Hiromasa ^{[1
]}

Klapuri, Anssi ^{[1
]}

Plumbley, Mark D. ^{[1
]}

机构：

[1] Queen Mary Univ London, Ctr Digital Mus, London, England

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

基金：

英国工程与自然科学研究理事会;

关键词：

Music similarity; Instrumentation; Sparse representation; Online dictionary learning;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper describes a novelmusic similarity calculation method that is based on the instrumentation of music pieces. The approach taken here is based on the idea that sparse representations of musical audio signals are a rich source of information regarding the elements that constitute the observed spectra. We propose a method to extract feature vectors based on sparse representations and use these to calculate a similarity measure between songs. To train a dictionary for sparse representations from a large amount of training data, a novel dictionary-initialization method based on agglomerative clustering is proposed. An objective evaluation shows that the new features improve the performance of similarity calculation compared to the standard mel-frequency cepstral coefficients features.

引用

页码：433 / 436

页数：4

共 20 条

[1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].

Aharon, Michal ;

Elad, Michael ;

Bruckstein, Alfred .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322

[2]

[Anonymous], P 11 ISMIR

[3]

[Anonymous], 2000, Pattern Classification

[4] A large-scale evaluation of acoustic and subjective music-similarity measures [J].

Berenzweig, A ;

Logan, B ;

Ellis, DPW ;

Whitman, B .

COMPUTER MUSIC JOURNAL, 2004, 28 (02) :63-76

[5]

Breese J. S., 1998, Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference (1998), P43

[6] The music information retrieval evaluation exchange (2005-2007): A window into music information retrieval research [J].

Downie, J. Stephen .

ACOUSTICAL SCIENCE AND TECHNOLOGY, 2008, 29 (04) :247-255

[7] Least angle regression - Rejoinder [J].

Efron, B ;

Hastie, T ;

Johnstone, I ;

Tibshirani, R .

ANNALS OF STATISTICS, 2004, 32 (02) :494-499

[8] A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval [J].

Fujihara, Hiromasa ;

Goto, Masataka ;

Kitahara, Tetsuro ;

Okuno, Hiroshi G. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03) :638-648

[9] Musical genre classification using nonnegative matrix factorization-based features [J].

Holzapfel, Andre ;

Stylianou, Yannis .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (02) :424-434

[10]

Hoyer PO, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P557, DOI 10.1109/NNSP.2002.1030067

← 1 2 →