Robust audio fingerprinting based on GammaChirp frequency cepstral coefficients and chroma

被引:3
|
作者
Chen, N. [1 ]
Xiao, H. D. [2 ]
Zhu, J. [3 ]
机构
[1] E China Univ Sci & Technol, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China
[2] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
D O I
10.1049/el.2013.3554
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel auditory feature that combines an auditory model and music theory is proposed for audio fingerprinting. First, the input audio is filtered by a GammaChirp (GC) filterbank to model the cochlear frequency selectivity. Then, the output of the filterbank is downsampled and decorrelated by a discrete cosine transform to obtain the GammaChirp frequency cepstral coefficients (GCFCCs). Next, some lowest order GCFCCs are projected onto the chroma to characterise both melodic and harmonic information of the input. Finally, non-negative matrix factorisation is applied to the chroma matrix to reduce its dimension while maintaining its discriminative power. The experimental results illustrate that the proposed scheme achieves a stabler identification rate and lower computational complexity than the schemes based on the Mel-frequency cepstral coefficients. © The Institution of Engineering and Technology 2014.
引用
收藏
页码:241 / U174
页数:2
相关论文
共 50 条
  • [21] Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification
    Srivastava, Sumit
    Chandra, Mahesh
    Sahoo, G.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 3, INDIA 2016, 2016, 435 : 309 - 316
  • [22] Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients
    Nasr, Marwa A.
    Abd-Elnaby, Mohammed
    El-Fishawy, Adel S.
    El-Rabaie, S.
    Abd El-Samie, Fathi E.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 941 - 951
  • [23] DCT BASED MULTIPLE HASHING TECHNIQUE FOR ROBUST AUDIO FINGERPRINTING
    Liu, Yu
    Cho, Kiho
    Yun, Hwan Sik
    Shin, Jong Won
    Kim, Nain Soo
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 61 - +
  • [24] Mel Frequency Cepstral Coefficients Based Similar Albanian Phonemes Recognition
    Karahoda, Bertan
    Pireva, Krenare
    Imran, Ali Shariq
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION, DESIGN AND INTERACTION, PT I, 2016, 9734 : 491 - 500
  • [25] Palmprint recognition based on Mel frequency Cepstral coefficients feature extraction
    Fahmy, Maged M. M.
    AIN SHAMS ENGINEERING JOURNAL, 2010, 1 (01) : 39 - 47
  • [26] A New Approach for Fingerprint Recognition Based on Mel Frequency Cepstral Coefficients
    Hashad, F. G.
    Halim, T. M.
    Diab, S. M.
    Sallam, B. M.
    2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES 2009), 2009, : 263 - +
  • [27] Perceptual MVDR-based cepstral coefficients (PMCCs) for robust speech recognition
    Yapanel, UH
    Dharanipragada, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 644 - 647
  • [28] Chroma-based statistical audio features for audio matching
    Müller, M
    Kurth, F
    Clausen, M
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 275 - 278
  • [29] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
    Adiga, Aniruddha
    Magimai-Doss, Mathew
    Seelamantula, Chandra Sekhar
    2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [30] Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme
    Shi, Yong-zhe
    Zhang, Wei-Qiang
    Liu, Jia
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2496 - 2499