A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

被引：65

作者：

Sahidullah, Md ^{[1
]}

Saha, Goutam ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur 721302, W Bengal, India

来源：

IEEE SIGNAL PROCESSING LETTERS | 2013年 / 20卷 / 02期

关键词：

Differentiation in frequency; power spectrum estimation; speaker recognition; tapered window; mel-frequency cepstral coefficients (MFCC);

D O I：

10.1109/LSP.2012.2235067

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this letter, we propose a novel family of windowing technique to compute mel frequency cepstral coefficient (MFCC) for automatic speaker recognition from speech. The proposed method is based on fundamental property of discrete time Fourier transform (DTFT) related to differentiation in frequency domain. Classical windowing scheme such as Hamming window is modified to obtain derivatives of discrete time Fourier transform coefficients. It is mathematically shown that this technique takes into account slope of power spectrum and phase information. Speaker recognition systems based on our proposed family of window functions are shown to attain substantial and consistent performance improvement over baseline single tapered Hamming window as well as recently proposed multitaper windowing technique.

引用

页码：149 / 152

页数：4

共 50 条

[1] NOVEL WINDOWING TECHNIQUE OF MFCC FOR SPEAKER IDENTIFICATION WITH MODIFIED POLYNOMIAL CLASSIFIERS
Bakshi, Aarti
Kopparapu, Sunil Kumar
Pawar, Sanjay
Nema, Shikha
2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 292 - 297
[2] A Modified MFCC Feature Extraction Technique For Robust Speaker Recognition
Sharma, Diksha
Ali, Israj
2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1052 - 1057
[3] An Efficient Text Dependent Speaker Recognition using Fusion of MFCC and SBC
Kishore, K. V. Krishna
Sharrefaunnisa, Syed.
Venkatramaphanikumar, S.
2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), 2015, : 18 - 22
[4] Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
Sahidullah, Md.
Saha, Goutam
SPEECH COMMUNICATION, 2012, 54 (04) : 543 - 565
[5] On the Use of MFCC Feature Vector Clustering for Efficient Text Dependent Speaker Recognition
Samal, Ankit
Parida, Deebyadeep
Satapathy, Mihir Ranjan
Mohanty, Mihir Narayan
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 305 - 312
[6] Speaker Recognition using MFCC, shifted MFCC with Vector Quantization and Fuzzy
Bansal, Priyanka
Imam, Syed Akhtar
Bharti, Roma
2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,
[7] The speaker recognition system based on the dynamic MFCC
Dong, Zhi-Feng
Wang, Zeng-Fu
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (05): : 596 - 601
[8] On the importance of components of the MFCC in speech and speaker recognition
Zhen, B.
Wu, X.
Liu, Z.
Chi, H.
Beijing Daxue Xuebao Ziran Kexue Ban/Acta Scientiarum uaturalium Universitatis Pekinensis, 2001, 37 (03): : 371 - 378
[9] Speaker Recognition by Combining MFCC and Phase Information
Nakagawa, Seiichi
Asakawa, Kouhei
Wang, Longbiao
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1065 - 1068
[10] Speaker recognition by combining MFCC and phase information
Department of Information and Computer Sciences, Toyohashi University of Technology, Japan
Int. Speech Commun. Assoc. - Annu. Conf. Int. Speech Commun. Assoc., Interspeech, (1065-1068):

← 1 2 3 4 5 →