Fusing similarity functions for cover song identification

被引:11
|
作者
Chen, Ning [1 ]
Li, Wei [2 ]
Xiao, Haidong [3 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, 130 Meilong Rd, Shanghai 200237, Peoples R China
[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai 201203, Peoples R China
[3] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Cover song identification (CSI); Qmax; Dmax; Similarity network fusion (SNF); MUSIC INFORMATION-RETRIEVAL; AUDIO; FEATURES;
D O I
10.1007/s11042-017-4456-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cover Song Identification (CSI) technique, refers to the process of identifying an alternative version, performance, rendition, or recording of a previously recorded musical composition by measuring and modeling the musical similarity between them quantitatively and objectively. However, it is not possible to describe the similarity between tracks comprehensively and reliably with only one similarity function. In this paper, the Similarity Network Fusion (SNF) technique, which was originally proposed for combining different kernels for predicting drug-target interactions, is adopted to fuse different similarities based on the same descriptor and different similarity functions. First, the Harmonic Pitch Class Profile (HPCP) is extracted from each track. Next, the similarities, in terms of Qmax and Dmax measures, between the HPCP descriptors of any two tracks are calculated, respectively. Then, the track-by-track similarity networks based on Qmax and on Dmax similarity are constructed separately and then fused into one network by SNF. Finally, the fused similarities obtained from the fused similarity network are adopted to train a classifier, which can then be used to identify whether the input two tracks belong to reference/cover or reference/non-cover pair. Experimental results on Covers80 (http:// labrosa. ee. columbia. edu/projects/coversongs/ covers80/), subset of SecondHandSongs (SHS) (http:// labrosa. ee. columbia. edu/millionsong/secondhand), and the Mixed Collection and Mazurka Cover Collection provided by MIREX (http:// www. music-ir.org/mirex/wiki/2016: Audio Cover Song Identification) demonstrate that the proposed scheme performs comparably with or even better than state-of-the-art CSI schemes.
引用
收藏
页码:2629 / 2652
页数:24
相关论文
共 50 条
  • [31] MUSIC FINGERPRINT EXTRACTION FOR CLASSICAL MUSIC COVER SONG IDENTIFICATION
    Kim, Samuel
    Unal, Erdem
    Narayanan, Shrikanth
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1261 - 1264
  • [32] On Accuracy and Time Processing Evaluation of Cover Song Identification Systems
    Ferreira, Martha Dais
    Correa, Debora Cristina
    Grivet, Marcos Antonio
    dos Santos, Geovan Tavares
    de Mello, Rodrigo Fernandes
    Nonato, Luis Gustavo
    JOURNAL OF NEW MUSIC RESEARCH, 2016, 45 (04) : 333 - 342
  • [33] Dynamic chroma feature vectors with applications to cover song identification
    Kim, Samuel
    Narayanan, Shrikanth
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 988 - 991
  • [34] Pairwise Similarity Normalization Based on a Hubness Score for Improving Cover Song Retrieval Accuracy
    Seo, Jin S.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 1130 - 1134
  • [35] A relevance-based pairwise chromagram similarity for improving cover song retrieval accuracy
    Seo, Jin Soo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (02): : 200 - 206
  • [36] IDENTIFICATION OF COVER SONGS USING INFORMATION THEORETIC MEASURES OF SIMILARITY
    Foster, Peter
    Dixon, Simon
    Klapuri, Anssi
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 739 - 743
  • [37] Content-Based Cover Song Identification in Music Digital Libraries
    Miotto, Riccardo
    Montecchio, Nicola
    Orio, Nicola
    DIGITAL LIBRARIES, 2010, 91 : 195 - 204
  • [38] Karalk: a karaoke dataset for cover song identification and singing voice analysis
    Bayle, Yann
    Marsik, Ladislav
    Rusek, Martin
    Robine, Matthias
    Hanna, Pierre
    Slaninova, Katerina
    Martinovic, Jan
    Pokorny, Jaroslav
    2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 177 - 184
  • [39] WideResNet with Joint Representation Learning and Data Augmentation for Cover Song Identification
    Hu, Shichao
    Zhang, Bin
    Lu, Jinhong
    Jiang, Yiliang
    Wang, Wucheng
    Kong, Lingcheng
    Zhao, Weifeng
    Jiang, Tao
    INTERSPEECH 2022, 2022, : 4187 - 4191
  • [40] LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK
    Yu, Zhesong
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 541 - 545