Features Extracted Using Frequency-Time Analysis Approach from Nyquist Filter Bank and Gaussian Filter Bank for Text-Independent Speaker Identification

被引：0

作者：

Sen, Nirmalya ^{[1
]}

Basu, T. K. ^{[2
]}

机构：

[1] IIT Kharagpur, CET, Signal Proc Res Grp, Kharagpur, W Bengal, India

[2] IIT Kharagpur, Dept Elect Engn, Kharagpur, W Bengal, India

来源：

BIOMETRICS AND ID MANAGEMENT | 2011年 / 6583卷

关键词：

Speaker identification; Feature extraction; Frequency-time analysis; RECOGNITION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper compares the feature sets extracted using frequency-time analysis approach and time-frequency analysis approach for text-independent speaker identification. The impetus for the frequency-time analysis approach comes from the band pass filtering view of STFT. Nyquist filter bank and Gaussian filter bank both have been used for extracting features using frequency-time analysis approach. Experimental evaluation was conducted on the POLYCOST database with 130 speakers using Gaussian mixture speaker model. Results reveal that, the feature sets extracted using frequency-time analysis approach performs significantly better compared to the feature set extracted using time-frequency analysis approach.

引用

页码：125 / +

页数：2

共 11 条

[1] Subband architecture for automatic speaker recognition
Besacier, L
Bonastre, JF
[J]. SIGNAL PROCESSING, 2000, 80 (07) : 1245 - 1259
[2] CHAKROBORTY S, 2007, INT J SIGNAL PROCESS, V4, P1304
[3] DAVIS S, 1980, SIGNAL PROCESS, V4, P357
[4] HAYAKAWA S, 1994, INT CONF ACOUST SPEE, P137
[5] HAYKIN SS, 2001, SIGNALS SYSTEMS
[6] An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
Lu, Xugang
Dang, Jianwu
[J]. SPEECH COMMUNICATION, 2008, 50 (04) : 312 - 322
[7] Petrovska D, 1998, RLA2C, P211
[8] Quatieri T. F., DISCRETE TIME SPEECH
[9] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
REYNOLDS, DA
ROSE, RC
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83
[10] SEN N, 2010, 5 INT C IND INF SYST, P61

← 1 2 →