Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain

被引:3
|
作者
Li, Wei [1 ]
Xiao, Chuan [1 ]
Liu, Yaduo [2 ]
机构
[1] Fudan Univ, Sch Comp Sci & Technol, Shanghai 201203, Peoples R China
[2] China Elect Power Res Inst, Beijing 100192, Peoples R China
关键词
Music identification; MPEG; Compressed domain; Zernike moment; Robustness; Auditory image;
D O I
10.1186/1687-6180-2013-132
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Audio identification via fingerprint has been an active research field for years. However, most previously reported methods work on the raw audio format in spite of the fact that nowadays compressed format audio, especially MP3 music, has grown into the dominant way to store music on personal computers and/or transmit it over the Internet. It will be interesting if a compressed unknown audio fragment could be directly recognized from the database without decompressing it into the wave format at first. So far, very few algorithms run directly on the compressed domain for music information retrieval, and most of them take advantage of the modified discrete cosine transform coefficients or derived cepstrum and energy type of features. As a first attempt, we propose in this paper utilizing compressed domain auditory Zernike moment adapted from image processing techniques as the key feature to devise a novel robust audio identification algorithm. Such fingerprint exhibits strong robustness, due to its statistically stable nature, against various audio signal distortions such as recompression, noise contamination, echo adding, equalization, band-pass filtering, pitch shifting, and slight time scale modification. Experimental results show that in a music database which is composed of 21,185 MP3 songs, a 10-s long music segment is able to identify its original near-duplicate recording, with average top-5 hit rate up to 90% or above even under severe audio signal distortions.
引用
收藏
页数:15
相关论文
共 23 条
  • [1] Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain
    Wei Li
    Chuan Xiao
    Yaduo Liu
    EURASIP Journal on Advances in Signal Processing, 2013
  • [2] Robust Music Identification Based on Low-Order Zernike Moment in the Compressed Domain
    Li, Wei
    Liu, Yaduo
    Xue, Xiangyang
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 739 - 740
  • [3] Robust audio watermarking based on low-order Zernike moments
    Xiang, Shijun
    Huang, Jiwu
    Yang, Rui
    Wang, Chuntao
    Liu, Hongmei
    DIGITAL WATERMARKING, PROCEEDINGS, 2006, 4283 : 226 - 240
  • [4] Robust online music identification using spectral entropy in the compressed domain
    Yin, Changqing
    Li, Wei
    Luo, Yuanqing
    Tseng, Li-Chuan
    2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2014, : 128 - +
  • [5] Verification system robust to occlusion using low-order Zernike moments of palmprint sub-images
    G. S. Badrinath
    Naresh K. Kachhi
    Phalguni Gupta
    Telecommunication Systems, 2011, 47 : 275 - 290
  • [6] Robust tuning of low-order controllers via uncertainty model identification
    Canale, M
    Fiorio, G
    Malan, S
    Taragna, M
    EUROPEAN JOURNAL OF CONTROL, 1999, 5 (2-4) : 316 - 328
  • [7] Verification system robust to occlusion using low-order Zernike moments of palmprint sub-images
    Badrinath, G. S.
    Kachhi, Naresh K.
    Gupta, Phalguni
    TELECOMMUNICATION SYSTEMS, 2011, 47 (3-4) : 275 - 290
  • [8] Time-domain identification of low-order models for flexible structures
    Bauer, RJ
    Hughes, PC
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1999, 22 (06) : 908 - 909
  • [9] Time-domain identification of low-order models for flexible structures
    Dalhousie University, Halifax, NS B3J 1Z1, Canada
    不详
    J Guid Control Dyn, 6 (908-909):
  • [10] Low-Order Automatic Domain Splitting Approach for Nonlinear Uncertainty Mapping
    Losacco, Matteo
    Fossa, Alberto
    Armellin, Roberto
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2024, 47 (02) : 291 - 310