Fusing similarity functions for cover song identification

被引:11
|
作者
Chen, Ning [1 ]
Li, Wei [2 ]
Xiao, Haidong [3 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, 130 Meilong Rd, Shanghai 200237, Peoples R China
[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai 201203, Peoples R China
[3] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Cover song identification (CSI); Qmax; Dmax; Similarity network fusion (SNF); MUSIC INFORMATION-RETRIEVAL; AUDIO; FEATURES;
D O I
10.1007/s11042-017-4456-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cover Song Identification (CSI) technique, refers to the process of identifying an alternative version, performance, rendition, or recording of a previously recorded musical composition by measuring and modeling the musical similarity between them quantitatively and objectively. However, it is not possible to describe the similarity between tracks comprehensively and reliably with only one similarity function. In this paper, the Similarity Network Fusion (SNF) technique, which was originally proposed for combining different kernels for predicting drug-target interactions, is adopted to fuse different similarities based on the same descriptor and different similarity functions. First, the Harmonic Pitch Class Profile (HPCP) is extracted from each track. Next, the similarities, in terms of Qmax and Dmax measures, between the HPCP descriptors of any two tracks are calculated, respectively. Then, the track-by-track similarity networks based on Qmax and on Dmax similarity are constructed separately and then fused into one network by SNF. Finally, the fused similarities obtained from the fused similarity network are adopted to train a classifier, which can then be used to identify whether the input two tracks belong to reference/cover or reference/non-cover pair. Experimental results on Covers80 (http:// labrosa. ee. columbia. edu/projects/coversongs/ covers80/), subset of SecondHandSongs (SHS) (http:// labrosa. ee. columbia. edu/millionsong/secondhand), and the Mixed Collection and Mazurka Cover Collection provided by MIREX (http:// www. music-ir.org/mirex/wiki/2016: Audio Cover Song Identification) demonstrate that the proposed scheme performs comparably with or even better than state-of-the-art CSI schemes.
引用
收藏
页码:2629 / 2652
页数:24
相关论文
共 50 条
  • [1] Fusing similarity functions for cover song identification
    Ning Chen
    Wei Li
    Haidong Xiao
    Multimedia Tools and Applications, 2018, 77 : 2629 - 2652
  • [2] Similarity fusion scheme for cover song identification
    Chen, Ning
    Xiao, Hai-dong
    ELECTRONICS LETTERS, 2016, 52 (13) : 1173 - 1174
  • [3] Improved similarity fusion scheme for cover song identification
    Fan, Yanlan
    Chen, Ning
    ELECTRONICS LETTERS, 2018, 54 (24) : 1403 - 1404
  • [4] A code-based chromagram similarity for cover song identification
    Seo, Jin Soo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (03): : 314 - 319
  • [5] Two-layer similarity fusion model for cover song identification
    Chen, Ning
    Li, Mingyu
    Xiao, Haidong
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
  • [6] Chroma binary similarity and local alignment applied to cover song identification
    Serra, Joan
    Gomez, Emilia
    Herrera, Perfecto
    Serra, Xavier
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (06): : 1138 - 1151
  • [7] Two-layer similarity fusion model for cover song identification
    Ning Chen
    Mingyu Li
    Haidong Xiao
    EURASIP Journal on Audio, Speech, and Music Processing, 2017
  • [8] COVER SONG IDENTIFICATION USING SONG-TO-SONG CROSS-SIMILARITY MATRIX WITH CONVOLUTIONAL NEURAL NETWORK
    Lee, Juheon
    Chang, Sungkyun
    Choe, Sang Keun
    Lee, Kyogu
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 396 - 400
  • [9] A music similarity function based on probabilistic linear discriminant analysis for cover song identification
    Seo, Jin Soo
    Kim, Junghyun
    Kim, Hyemi
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2022, 41 (06): : 662 - 667
  • [10] SIMILARITY LEARNING FOR COVER SONG IDENTIFICATION USING CROSS-SIMILARITY MATRICES OF MULTI-LEVEL DEEP SEQUENCES
    Jiang, Chaoya
    Yang, Deshun
    Chen, Xiaoou
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 26 - 30