On enhancing feature sequence filtering with filter-bank energy transformation in speaker verification with telephone speech

被引:0
作者
Garreton, Claudio [1 ]
Becerra Yoma, Nestor [1 ]
机构
[1] Univ Chile, Dept Elect Engn, Speech Proc & Transmiss Lab, Santiago, Chile
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
robust features; speaker recognition; text-dependent speaker verification; telephone speech; BIAS REMOVAL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper a novel feature enhancing method for channel robustness with short utterances is employed. The transform reduces the time-varying component of the channel distortion by applying a band-pass filter along the filter-bank domain on a frame-by-frame basis. This procedure enhances the channel cancelling effect given by techniques based on feature trajectory filtering. The transformation parameters are defined employing relative importance analysis based on a discriminant function. In text-dependent speaker verification with telephone speech the transform leads to a reduction in the EER of 10.8%, and further improvements of 23.5% and 40% when combined with RASTA or CMN, respectively.
引用
收藏
页码:1461 / 1464
页数:4
相关论文
共 15 条
[1]  
Campbell J., 1994, YOHO SPEAKER VERIFIC
[2]   CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272
[3]   Channel Robust Feature Transformation Based on Filter-Bank Energy Filtering [J].
Garreton, Claudio ;
Becerra Yoma, Nestor ;
Torres, Matias .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (05) :1082-1086
[4]   Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains [J].
Gauvain, Jean-Luc ;
Lee, Chin-Hui .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :291-298
[5]   Filtering of filter-bank energies for robust speech recognition [J].
Jung, HY .
ETRI JOURNAL, 2004, 26 (03) :273-276
[6]   On the relative importance of various components of the modulation spectrum for automatic speech recognition [J].
Kanedera, N ;
Arai, T ;
Hermansky, H ;
Pavel, M .
SPEECH COMMUNICATION, 1999, 28 (01) :43-55
[7]   MAXIMUM-LIKELIHOOD LINEAR-REGRESSION FOR SPEAKER ADAPTATION OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].
LEGGETTER, CJ ;
WOODLAND, PC .
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :171-185
[8]   Stochastic feature transformation with divergence-based out-of-handset rejection for robust speaker verification [J].
Mak, MW ;
Tsang, CL ;
Kung, SY .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :452-465
[9]   Time and frequency filtering of filter-bank energies for robust HMM speech recognition [J].
Nadeu, C ;
Macho, D ;
Hernando, J .
SPEECH COMMUNICATION, 2001, 34 (1-2) :93-114
[10]  
Rahim MG, 1996, IEEE T SPEECH AUDI P, V4, P19