A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

被引:65
|
作者
Sahidullah, Md [1 ]
Saha, Goutam [1 ]
机构
[1] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur 721302, W Bengal, India
关键词
Differentiation in frequency; power spectrum estimation; speaker recognition; tapered window; mel-frequency cepstral coefficients (MFCC);
D O I
10.1109/LSP.2012.2235067
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we propose a novel family of windowing technique to compute mel frequency cepstral coefficient (MFCC) for automatic speaker recognition from speech. The proposed method is based on fundamental property of discrete time Fourier transform (DTFT) related to differentiation in frequency domain. Classical windowing scheme such as Hamming window is modified to obtain derivatives of discrete time Fourier transform coefficients. It is mathematically shown that this technique takes into account slope of power spectrum and phase information. Speaker recognition systems based on our proposed family of window functions are shown to attain substantial and consistent performance improvement over baseline single tapered Hamming window as well as recently proposed multitaper windowing technique.
引用
收藏
页码:149 / 152
页数:4
相关论文
共 50 条
  • [31] An Anti-noise MFCC Extraction Algorithm for Speaker Recognition
    Wang, Hong
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON RESOURCE ENVIRONMENT AND INFORMATION TECHNOLOGY IN 2010 (REIT' 2010), 2010, : 154 - 158
  • [32] Research of Speaker Recognition Based on the Weighted Fisher Ratio of MFCC
    Huang, Chenchen
    Gong, Wei
    Fu, Wenlong
    Feng, Dongyu
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 904 - 907
  • [33] Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
    Visalakshi, R.
    Dhanalakshmi, P.
    Palanivel, S.
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 35 - 41
  • [35] Bionic optimization of MFCC features based on speaker fast recognition
    Lin, Zhaodong
    Di, Changan
    Chen, Xiong
    APPLIED ACOUSTICS, 2021, 173
  • [36] A Speaker Identification System using MFCC Features with VQ Technique
    Zulfiqar, Ali
    Muhammad, Aslam
    Enriquez A M, Martinez
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 115 - +
  • [37] Efficient Window for Monolingual and Crosslingual Speaker Identification using MFCC
    Nagaraja, B. G.
    Jayanna, H. S.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [38] A Speaker Identification Performance Comparison Based on The Classifier, The Computation Time and The Number Of MFCC
    Ozcan, Zubeyir
    Kayikcioglu, Temel
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [39] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
    Zhou, Xi
    Fu, Yun
    Liu, Ming
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
  • [40] Speaker gender recognition based on combining the contribution of MFCC and pitch features
    Engineering Lab on Intelligent Perception for Internet of Things, Shenzhen Graduate School, Peking University, Shenzhen 518055, Guangdong, China
    Huazhong Ligong Daxue Xuebao, 2013, SUPPL.I (108-111+120):