Deep learning of chroma representation for cover song identification in compression domain

被引:0
|
作者
Jiunn-Tsair Fang
Yu-Ruey Chang
Pao-Chi Chang
机构
[1] Ming Chuan University,Department of Electronic Engineering
[2] National Central University,Department of Communication Engineering
关键词
Cover song; Music retrieval; Sparse autoencoder; Descriptor; Advanced audio coding;
D O I
暂无
中图分类号
学科分类号
摘要
Methods for identifying a cover song typically involve comparing the similarity of chroma features between the query song and another song in the data set. However, considerable time is required for pairwise comparisons. In addition, to save disk space, most songs stored in the data set are in a compressed format. Therefore, to eliminate some decoding procedures, this study extracted music information directly from the modified discrete cosine transform coefficients of advanced audio coding and then mapped these coefficients to 12-dimensional chroma features. The chroma features were segmented to preserve the melodies. Each chroma feature segment was trained and learned by a sparse autoencoder, a deep learning architecture of artificial neural networks. The deep learning procedure was to transform chroma features into an intermediate representation for dimension reduction. Experimental results from a covers80 data set showed that the mean reciprocal rank increased to 0.5 and the matching time was reduced by over 94% compared with traditional approaches.
引用
收藏
页码:887 / 902
页数:15
相关论文
共 50 条
  • [1] Deep learning of chroma representation for cover song identification in compression domain
    Fang, Jiunn-Tsair
    Chang, Yu-Ruey
    Chang, Pao-Chi
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2018, 29 (03) : 887 - 902
  • [2] Entropy per Chroma for Cover Song Identification
    Camarena-Ibarrola, Antonio
    Figueroa, Karina
    Tejeda-Villela, Hector
    2016 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2016,
  • [3] Deep feature learning for cover song identification
    Fang, Jiunn-Tsair
    Day, Chi-Ting
    Chang, Pao-Chi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (22) : 23225 - 23238
  • [4] Deep feature learning for cover song identification
    Jiunn-Tsair Fang
    Chi-Ting Day
    Pao-Chi Chang
    Multimedia Tools and Applications, 2017, 76 : 23225 - 23238
  • [5] Dynamic chroma feature vectors with applications to cover song identification
    Kim, Samuel
    Narayanan, Shrikanth
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 988 - 991
  • [6] DisCover: Disentangled Music Representation Learning for Cover Song Identification
    Xun, Jiahao
    Zhang, Shengyu
    Yang, Yanting
    Zhu, Jieming
    Deng, Liqun
    Zhao, Zhou
    Dong, Zhenhua
    Li, Ruiqi
    Zhang, Lichao
    Wu, Fei
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 453 - 463
  • [7] Chroma binary similarity and local alignment applied to cover song identification
    Serra, Joan
    Gomez, Emilia
    Herrera, Perfecto
    Serra, Xavier
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (06): : 1138 - 1151
  • [8] LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK
    Yu, Zhesong
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 541 - 545
  • [9] WideResNet with Joint Representation Learning and Data Augmentation for Cover Song Identification
    Hu, Shichao
    Zhang, Bin
    Lu, Jinhong
    Jiang, Yiliang
    Wang, Wucheng
    Kong, Lingcheng
    Zhao, Weifeng
    Jiang, Tao
    INTERSPEECH 2022, 2022, : 4187 - 4191
  • [10] Multi-Scale Chroma n-Gram Indexing for Cover Song Identification
    Seo, Jin S.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (01) : 59 - 62