On enhancing feature sequence filtering with filter-bank energy transformation in speaker verification with telephone speech

被引：0

作者：

Garreton, Claudio ^{[1
]}

Becerra Yoma, Nestor ^{[1
]}

机构：

[1] Univ Chile, Dept Elect Engn, Speech Proc & Transmiss Lab, Santiago, Chile

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年

关键词：

robust features; speaker recognition; text-dependent speaker verification; telephone speech; BIAS REMOVAL;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper a novel feature enhancing method for channel robustness with short utterances is employed. The transform reduces the time-varying component of the channel distortion by applying a band-pass filter along the filter-bank domain on a frame-by-frame basis. This procedure enhances the channel cancelling effect given by techniques based on feature trajectory filtering. The transformation parameters are defined employing relative importance analysis based on a discriminant function. In text-dependent speaker verification with telephone speech the transform leads to a reduction in the EER of 10.8%, and further improvements of 23.5% and 40% when combined with RASTA or CMN, respectively.

引用

页码：1461 / 1464

页数：4

共 15 条

[1]

Campbell J., 1994, YOHO SPEAKER VERIFIC

[2] CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].

FURUI, S .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272

[3] Channel Robust Feature Transformation Based on Filter-Bank Energy Filtering [J].

Garreton, Claudio ;

Becerra Yoma, Nestor ;

Torres, Matias .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (05) :1082-1086

[4] Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains [J].

Gauvain, Jean-Luc ;

Lee, Chin-Hui .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :291-298

[5] Filtering of filter-bank energies for robust speech recognition [J].

Jung, HY .

ETRI JOURNAL, 2004, 26 (03) :273-276

[6] On the relative importance of various components of the modulation spectrum for automatic speech recognition [J].

Kanedera, N ;

Arai, T ;

Hermansky, H ;

Pavel, M .

SPEECH COMMUNICATION, 1999, 28 (01) :43-55

[7] MAXIMUM-LIKELIHOOD LINEAR-REGRESSION FOR SPEAKER ADAPTATION OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].

LEGGETTER, CJ ;

WOODLAND, PC .

COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :171-185

[8] Stochastic feature transformation with divergence-based out-of-handset rejection for robust speaker verification [J].

Mak, MW ;

Tsang, CL ;

Kung, SY .

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :452-465

[9] Time and frequency filtering of filter-bank energies for robust HMM speech recognition [J].

Nadeu, C ;

Macho, D ;

Hernando, J .

SPEECH COMMUNICATION, 2001, 34 (1-2) :93-114

[10]

Rahim MG, 1996, IEEE T SPEECH AUDI P, V4, P19

← 1 2 →