Features Extracted Using Frequency-Time Analysis Approach from Nyquist Filter Bank and Gaussian Filter Bank for Text-Independent Speaker Identification

被引:0
作者
Sen, Nirmalya [1 ]
Basu, T. K. [2 ]
机构
[1] IIT Kharagpur, CET, Signal Proc Res Grp, Kharagpur, W Bengal, India
[2] IIT Kharagpur, Dept Elect Engn, Kharagpur, W Bengal, India
来源
BIOMETRICS AND ID MANAGEMENT | 2011年 / 6583卷
关键词
Speaker identification; Feature extraction; Frequency-time analysis; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper compares the feature sets extracted using frequency-time analysis approach and time-frequency analysis approach for text-independent speaker identification. The impetus for the frequency-time analysis approach comes from the band pass filtering view of STFT. Nyquist filter bank and Gaussian filter bank both have been used for extracting features using frequency-time analysis approach. Experimental evaluation was conducted on the POLYCOST database with 130 speakers using Gaussian mixture speaker model. Results reveal that, the feature sets extracted using frequency-time analysis approach performs significantly better compared to the feature set extracted using time-frequency analysis approach.
引用
收藏
页码:125 / +
页数:2
相关论文
共 11 条
  • [1] Subband architecture for automatic speaker recognition
    Besacier, L
    Bonastre, JF
    [J]. SIGNAL PROCESSING, 2000, 80 (07) : 1245 - 1259
  • [2] CHAKROBORTY S, 2007, INT J SIGNAL PROCESS, V4, P1304
  • [3] DAVIS S, 1980, SIGNAL PROCESS, V4, P357
  • [4] HAYAKAWA S, 1994, INT CONF ACOUST SPEE, P137
  • [5] HAYKIN SS, 2001, SIGNALS SYSTEMS
  • [6] An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
    Lu, Xugang
    Dang, Jianwu
    [J]. SPEECH COMMUNICATION, 2008, 50 (04) : 312 - 322
  • [7] Petrovska D, 1998, RLA2C, P211
  • [8] Quatieri T. F., DISCRETE TIME SPEECH
  • [9] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    ROSE, RC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83
  • [10] SEN N, 2010, 5 INT C IND INF SYST, P61