COVER SONG IDENTIFICATION USING SONG-TO-SONG CROSS-SIMILARITY MATRIX WITH CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Lee, Juheon [1 ,4 ]
Chang, Sungkyun [2 ,4 ]
Choe, Sang Keun [3 ,4 ]
Lee, Kyogu [2 ,4 ]
机构
[1] Seoul Natl Univ, Coll Liberal Studies, Seoul 08826, South Korea
[2] Seoul Natl Univ, Mus & Audio Res Grp, Seoul 08826, South Korea
[3] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea
[4] Seoul Natl Univ, Ctr Superintelligence, Seoul 08826, South Korea
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
基金
新加坡国家研究基金会;
关键词
Music Information Retrieval; Convolutional Neural Network; Cover Song Identification; Cross-similarity Matrix;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a cover song identification algorithm using a convolutional neural network (CNN). We first train the CNN model to classify any non-/cover relationship, by feeding a cross-similarity matrix that is generated from a pair of songs as an input. Our main idea is to use the CNN output-the cover-probabilities of one song to all other candidate songs-as a new representation vector for measuring the distance between songs. Based on this, the present algorithm searches cover songs by applying several ranking methods: 1. sorting without using the representation vectors; 2. the cosine distance between the representation vectors; and 3. the correlation between the vectors. In our experiment, the proposed algorithm significantly outperformed the algorithms used in recent studies, by achieving a mean average precision (MAP) of 93.18% in a dataset consisting of 3,300 cover-pairs and 496,200 non-cover-pairs.
引用
收藏
页码:396 / 400
页数:5
相关论文
共 50 条
  • [1] LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK
    Yu, Zhesong
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 541 - 545
  • [2] SIMILARITY LEARNING FOR COVER SONG IDENTIFICATION USING CROSS-SIMILARITY MATRICES OF MULTI-LEVEL DEEP SEQUENCES
    Jiang, Chaoya
    Yang, Deshun
    Chen, Xiaoou
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 26 - 30
  • [3] Cross-Similarity Measurement of Music Sections: A Framework for Large-scale Cover Song Identification
    Cai, Kang
    Yang, Deshun
    Chen, Xiaoou
    ADVANCES IN INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, 2017, 63 : 151 - 158
  • [4] Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification
    Yu, Zhesong
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4846 - 4852
  • [5] KEY-INVARIANT CONVOLUTIONAL NEURAL NETWORK TOWARD EFFICIENT COVER SONG IDENTIFICATION
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [6] Similarity fusion scheme for cover song identification
    Chen, Ning
    Xiao, Hai-dong
    ELECTRONICS LETTERS, 2016, 52 (13) : 1173 - 1174
  • [7] Fusing similarity functions for cover song identification
    Ning Chen
    Wei Li
    Haidong Xiao
    Multimedia Tools and Applications, 2018, 77 : 2629 - 2652
  • [8] Fusing similarity functions for cover song identification
    Chen, Ning
    Li, Wei
    Xiao, Haidong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (02) : 2629 - 2652
  • [9] Cross recurrence quantification for cover song identification
    Serra, Joan
    Serra, Xavier
    Andrzejak, Ralph G.
    NEW JOURNAL OF PHYSICS, 2009, 11
  • [10] Improved similarity fusion scheme for cover song identification
    Fan, Yanlan
    Chen, Ning
    ELECTRONICS LETTERS, 2018, 54 (24) : 1403 - 1404