Robust audio fingerprinting based on GammaChirp frequency cepstral coefficients and chroma

被引:3
|
作者
Chen, N. [1 ]
Xiao, H. D. [2 ]
Zhu, J. [3 ]
机构
[1] E China Univ Sci & Technol, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China
[2] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
D O I
10.1049/el.2013.3554
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel auditory feature that combines an auditory model and music theory is proposed for audio fingerprinting. First, the input audio is filtered by a GammaChirp (GC) filterbank to model the cochlear frequency selectivity. Then, the output of the filterbank is downsampled and decorrelated by a discrete cosine transform to obtain the GammaChirp frequency cepstral coefficients (GCFCCs). Next, some lowest order GCFCCs are projected onto the chroma to characterise both melodic and harmonic information of the input. Finally, non-negative matrix factorisation is applied to the chroma matrix to reduce its dimension while maintaining its discriminative power. The experimental results illustrate that the proposed scheme achieves a stabler identification rate and lower computational complexity than the schemes based on the Mel-frequency cepstral coefficients. © The Institution of Engineering and Technology 2014.
引用
收藏
页码:241 / U174
页数:2
相关论文
共 50 条
  • [41] RETRACTED: Breathing site classification via joint mel frequency cepstral coefficients and gammatone frequency cepstral coefficients approach (Retracted Article)
    Zhang, Jiarui
    Ling, Bingo Wing-Kuen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 3623 - 3634
  • [42] Audio fingerprinting based on analyzing time-frequency localization of signals
    Lu, CS
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 174 - 177
  • [43] Multilayered convolutional neural network-based auto-CODEC for audio signal denoising using mel-frequency cepstral coefficients
    Raj, Shivangi
    Prakasam, P.
    Gupta, Shubham
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (16): : 10199 - 10209
  • [44] Modified Mel-frequency Cepstral Coefficients (MMFCC) in Robust Text-dependent Speaker Identification
    Islam, Md. Atiqul
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 505 - 509
  • [45] Multilayered convolutional neural network-based auto-CODEC for audio signal denoising using mel-frequency cepstral coefficients
    Shivangi Raj
    P. Prakasam
    Shubham Gupta
    Neural Computing and Applications, 2021, 33 : 10199 - 10209
  • [46] Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients
    Quoc-Huy Nguyen
    Unoki, Masashi
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 21 - 25
  • [47] Wavelet-Based Audio Fingerprinting Algorithm Robust to Linear Speed Change
    Liu, Jixin
    Zhang, Tingxian
    COMPUTING AND INTELLIGENT SYSTEMS, PT IV, 2011, 234 : 360 - +
  • [48] Wavelet-based Audio Fingerprinting Algorithm Robust to Linear Speed Change
    Liu, Jixin
    Zhang, Tingxian
    2010 INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING (ICBSSP 2010), 2010, : 73 - 76
  • [49] Frequency-temporal filtering for a robust audio fingerprinting scheme in real-noise environments
    Park, Mansoo
    Kim, Hoi-Rin
    Yang, Seung Hyun
    ETRI JOURNAL, 2006, 28 (04) : 509 - 512
  • [50] DELTA-SPECTRAL CEPSTRAL COEFFICIENTS FOR ROBUST SPEECH RECOGNITION
    Kumar, Kshitiz
    Kim, Chanwoo
    Stern, Richard M.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4784 - 4787