A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

被引：65

作者：

Sahidullah, Md ^{[1
]}

Saha, Goutam ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur 721302, W Bengal, India

来源：

IEEE SIGNAL PROCESSING LETTERS | 2013年 / 20卷 / 02期

关键词：

Differentiation in frequency; power spectrum estimation; speaker recognition; tapered window; mel-frequency cepstral coefficients (MFCC);

D O I：

10.1109/LSP.2012.2235067

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this letter, we propose a novel family of windowing technique to compute mel frequency cepstral coefficient (MFCC) for automatic speaker recognition from speech. The proposed method is based on fundamental property of discrete time Fourier transform (DTFT) related to differentiation in frequency domain. Classical windowing scheme such as Hamming window is modified to obtain derivatives of discrete time Fourier transform coefficients. It is mathematically shown that this technique takes into account slope of power spectrum and phase information. Speaker recognition systems based on our proposed family of window functions are shown to attain substantial and consistent performance improvement over baseline single tapered Hamming window as well as recently proposed multitaper windowing technique.

引用

页码：149 / 152

页数：4

共 50 条

[31] An Anti-noise MFCC Extraction Algorithm for Speaker Recognition
Wang, Hong
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON RESOURCE ENVIRONMENT AND INFORMATION TECHNOLOGY IN 2010 (REIT' 2010), 2010, : 154 - 158
[32] Research of Speaker Recognition Based on the Weighted Fisher Ratio of MFCC
Huang, Chenchen
Gong, Wei
Fu, Wenlong
Feng, Dongyu
PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 904 - 907
[33] Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
Visalakshi, R.
Dhanalakshmi, P.
Palanivel, S.
COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 35 - 41
[34] Speaker recognition using MFCC and hybrid model of VQ and GMM
1600, Springer Verlag (235):
[35] Bionic optimization of MFCC features based on speaker fast recognition
Lin, Zhaodong
Di, Changan
Chen, Xiong
APPLIED ACOUSTICS, 2021, 173
[36] A Speaker Identification System using MFCC Features with VQ Technique
Zulfiqar, Ali
Muhammad, Aslam
Enriquez A M, Martinez
2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 115 - +
[37] Efficient Window for Monolingual and Crosslingual Speaker Identification using MFCC
Nagaraja, B. G.
Jayanna, H. S.
PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
[38] A Speaker Identification Performance Comparison Based on The Classifier, The Computation Time and The Number Of MFCC
Ozcan, Zubeyir
Kayikcioglu, Temel
2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
[39] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
Zhou, Xi
Fu, Yun
Liu, Ming
Hasegawa-Johnson, Mark
Huang, Thomas S.
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
[40] Speaker gender recognition based on combining the contribution of MFCC and pitch features
Engineering Lab on Intelligent Perception for Internet of Things, Shenzhen Graduate School, Peking University, Shenzhen 518055, Guangdong, China
Huazhong Ligong Daxue Xuebao, 2013, SUPPL.I (108-111+120):

← 1 2 3 4 5 →