Fusing similarity functions for cover song identification

被引:11
|
作者
Chen, Ning [1 ]
Li, Wei [2 ]
Xiao, Haidong [3 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, 130 Meilong Rd, Shanghai 200237, Peoples R China
[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai 201203, Peoples R China
[3] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Cover song identification (CSI); Qmax; Dmax; Similarity network fusion (SNF); MUSIC INFORMATION-RETRIEVAL; AUDIO; FEATURES;
D O I
10.1007/s11042-017-4456-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cover Song Identification (CSI) technique, refers to the process of identifying an alternative version, performance, rendition, or recording of a previously recorded musical composition by measuring and modeling the musical similarity between them quantitatively and objectively. However, it is not possible to describe the similarity between tracks comprehensively and reliably with only one similarity function. In this paper, the Similarity Network Fusion (SNF) technique, which was originally proposed for combining different kernels for predicting drug-target interactions, is adopted to fuse different similarities based on the same descriptor and different similarity functions. First, the Harmonic Pitch Class Profile (HPCP) is extracted from each track. Next, the similarities, in terms of Qmax and Dmax measures, between the HPCP descriptors of any two tracks are calculated, respectively. Then, the track-by-track similarity networks based on Qmax and on Dmax similarity are constructed separately and then fused into one network by SNF. Finally, the fused similarities obtained from the fused similarity network are adopted to train a classifier, which can then be used to identify whether the input two tracks belong to reference/cover or reference/non-cover pair. Experimental results on Covers80 (http:// labrosa. ee. columbia. edu/projects/coversongs/ covers80/), subset of SecondHandSongs (SHS) (http:// labrosa. ee. columbia. edu/millionsong/secondhand), and the Mixed Collection and Mazurka Cover Collection provided by MIREX (http:// www. music-ir.org/mirex/wiki/2016: Audio Cover Song Identification) demonstrate that the proposed scheme performs comparably with or even better than state-of-the-art CSI schemes.
引用
收藏
页码:2629 / 2652
页数:24
相关论文
共 50 条
  • [41] Deep learning of chroma representation for cover song identification in compression domain
    Jiunn-Tsair Fang
    Yu-Ruey Chang
    Pao-Chi Chang
    Multidimensional Systems and Signal Processing, 2018, 29 : 887 - 902
  • [42] BYTECOVER: COVER SONG IDENTIFICATION VIA MULTI-LOSS TRAINING
    Du, Xingjian
    Yu, Zhesong
    Zhu, Bilei
    Chen, Xiaoou
    Ma, Zejun
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 551 - 555
  • [43] CQTXNet: A Modified Xception Network with Attention Modules for Cover Song Identification
    Seo, Jinsoo
    Kim, Junghyun
    Kim, Hyemi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (01) : 49 - 52
  • [44] Salient Chromagram Extraction Based on Trend Removal for Cover Song Identification
    Seo, Jin S.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (01): : 51 - 54
  • [45] Efficient Two-Layer Model Towards Cover Song Identification
    Xu, Xiaoshuo
    Cheng, Yao
    Chen, Xiaoou
    Yang, Deshun
    MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 118 - 128
  • [46] Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification
    Yu, Zhesong
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4846 - 4852
  • [47] COVER SONG IDENTIFICATION WITH 2D FOURIER TRANSFORM SEQUENCES
    Seetharaman, Prem
    Rafii, Zajar
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 616 - 620
  • [48] Deep learning of chroma representation for cover song identification in compression domain
    Fang, Jiunn-Tsair
    Chang, Yu-Ruey
    Chang, Pao-Chi
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2018, 29 (03) : 887 - 902
  • [49] FUSING FEATURE AND SIMILARITY FOR MULTIMODAL SEARCH
    Song, Guoli
    Wang, Shuhui
    Tian, Qi
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 787 - 791
  • [50] Applying the Distance and Similarity Functions to Radar Signals Identification
    Dudczyk, Janusz
    Kawalec, Adam
    Cyrek, Jacek
    2008 PROCEEDINGS INTERNATIONAL RADAR SYMPOSIUM, 2008, : 298 - +