Robust audio fingerprinting based on GammaChirp frequency cepstral coefficients and chroma

被引：3

作者：

Chen, N. ^{[1
]}

Xiao, H. D. ^{[2
]}

Zhu, J. ^{[3
]}

机构：

[1] E China Univ Sci & Technol, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China

[2] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China

[3] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

来源：

ELECTRONICS LETTERS | 2014年 / 50卷 / 04期

基金：

上海市自然科学基金; 中国国家自然科学基金;

关键词：

D O I：

10.1049/el.2013.3554

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A novel auditory feature that combines an auditory model and music theory is proposed for audio fingerprinting. First, the input audio is filtered by a GammaChirp (GC) filterbank to model the cochlear frequency selectivity. Then, the output of the filterbank is downsampled and decorrelated by a discrete cosine transform to obtain the GammaChirp frequency cepstral coefficients (GCFCCs). Next, some lowest order GCFCCs are projected onto the chroma to characterise both melodic and harmonic information of the input. Finally, non-negative matrix factorisation is applied to the chroma matrix to reduce its dimension while maintaining its discriminative power. The experimental results illustrate that the proposed scheme achieves a stabler identification rate and lower computational complexity than the schemes based on the Mel-frequency cepstral coefficients. © The Institution of Engineering and Technology 2014.

引用

页码：241 / U174

页数：2

共 50 条

[21] Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification
Srivastava, Sumit
Chandra, Mahesh
Sahoo, G.
INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 3, INDIA 2016, 2016, 435 : 309 - 316
[22] Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients
Nasr, Marwa A.
Abd-Elnaby, Mohammed
El-Fishawy, Adel S.
El-Rabaie, S.
Abd El-Samie, Fathi E.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 941 - 951
[23] DCT BASED MULTIPLE HASHING TECHNIQUE FOR ROBUST AUDIO FINGERPRINTING
Liu, Yu
Cho, Kiho
Yun, Hwan Sik
Shin, Jong Won
Kim, Nain Soo
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 61 - +
[24] Mel Frequency Cepstral Coefficients Based Similar Albanian Phonemes Recognition
Karahoda, Bertan
Pireva, Krenare
Imran, Ali Shariq
HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION, DESIGN AND INTERACTION, PT I, 2016, 9734 : 491 - 500
[25] Palmprint recognition based on Mel frequency Cepstral coefficients feature extraction
Fahmy, Maged M. M.
AIN SHAMS ENGINEERING JOURNAL, 2010, 1 (01) : 39 - 47
[26] A New Approach for Fingerprint Recognition Based on Mel Frequency Cepstral Coefficients
Hashad, F. G.
Halim, T. M.
Diab, S. M.
Sallam, B. M.
2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES 2009), 2009, : 263 - +
[27] Perceptual MVDR-based cepstral coefficients (PMCCs) for robust speech recognition
Yapanel, UH
Dharanipragada, S
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 644 - 647
[28] Chroma-based statistical audio features for audio matching
Müller, M
Kurth, F
Clausen, M
2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 275 - 278
[29] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
Adiga, Aniruddha
Magimai-Doss, Mathew
Seelamantula, Chandra Sekhar
2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
[30] Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme
Shi, Yong-zhe
Zhang, Wei-Qiang
Liu, Jia
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2496 - 2499

← 1 2 3 4 5 →