A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

被引:65
|
作者
Sahidullah, Md [1 ]
Saha, Goutam [1 ]
机构
[1] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur 721302, W Bengal, India
关键词
Differentiation in frequency; power spectrum estimation; speaker recognition; tapered window; mel-frequency cepstral coefficients (MFCC);
D O I
10.1109/LSP.2012.2235067
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we propose a novel family of windowing technique to compute mel frequency cepstral coefficient (MFCC) for automatic speaker recognition from speech. The proposed method is based on fundamental property of discrete time Fourier transform (DTFT) related to differentiation in frequency domain. Classical windowing scheme such as Hamming window is modified to obtain derivatives of discrete time Fourier transform coefficients. It is mathematically shown that this technique takes into account slope of power spectrum and phase information. Speaker recognition systems based on our proposed family of window functions are shown to attain substantial and consistent performance improvement over baseline single tapered Hamming window as well as recently proposed multitaper windowing technique.
引用
收藏
页码:149 / 152
页数:4
相关论文
共 50 条
  • [1] NOVEL WINDOWING TECHNIQUE OF MFCC FOR SPEAKER IDENTIFICATION WITH MODIFIED POLYNOMIAL CLASSIFIERS
    Bakshi, Aarti
    Kopparapu, Sunil Kumar
    Pawar, Sanjay
    Nema, Shikha
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 292 - 297
  • [2] A Modified MFCC Feature Extraction Technique For Robust Speaker Recognition
    Sharma, Diksha
    Ali, Israj
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1052 - 1057
  • [3] An Efficient Text Dependent Speaker Recognition using Fusion of MFCC and SBC
    Kishore, K. V. Krishna
    Sharrefaunnisa, Syed.
    Venkatramaphanikumar, S.
    2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), 2015, : 18 - 22
  • [4] Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
    Sahidullah, Md.
    Saha, Goutam
    SPEECH COMMUNICATION, 2012, 54 (04) : 543 - 565
  • [5] On the Use of MFCC Feature Vector Clustering for Efficient Text Dependent Speaker Recognition
    Samal, Ankit
    Parida, Deebyadeep
    Satapathy, Mihir Ranjan
    Mohanty, Mihir Narayan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 305 - 312
  • [6] Speaker Recognition using MFCC, shifted MFCC with Vector Quantization and Fuzzy
    Bansal, Priyanka
    Imam, Syed Akhtar
    Bharti, Roma
    2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,
  • [7] The speaker recognition system based on the dynamic MFCC
    Dong, Zhi-Feng
    Wang, Zeng-Fu
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (05): : 596 - 601
  • [8] On the importance of components of the MFCC in speech and speaker recognition
    Zhen, B.
    Wu, X.
    Liu, Z.
    Chi, H.
    Beijing Daxue Xuebao Ziran Kexue Ban/Acta Scientiarum uaturalium Universitatis Pekinensis, 2001, 37 (03): : 371 - 378
  • [9] Speaker Recognition by Combining MFCC and Phase Information
    Nakagawa, Seiichi
    Asakawa, Kouhei
    Wang, Longbiao
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1065 - 1068
  • [10] Speaker recognition by combining MFCC and phase information
    Department of Information and Computer Sciences, Toyohashi University of Technology, Japan
    Int. Speech Commun. Assoc. - Annu. Conf. Int. Speech Commun. Assoc., Interspeech, (1065-1068):