A discriminative training algorithm for VQ-based speaker identification

被引:29
作者
He, JL [1 ]
Liu, L [1 ]
Palm, G [1 ]
机构
[1] Univ Ulm, Abt Neuroinformat, D-89069 Ulm, Germany
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1999年 / 7卷 / 03期
关键词
neural networks; speaker identification; training algorithms; vector quantization;
D O I
10.1109/89.759047
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A novel method, referred to as group vector quantization (GVQ), is proposed to train VQ codebooks for closed-set speaker identification, In GVQ training, speaker codebooks are optimized for vector groups rather than for individual vectors. An evaluation experiment has been conducted to compare the codebooks trained by the Linde-Buzo-Grey (LBG), the learning vector quantization (LVQ), and the GVQ algorithms. It is shown that the frame scores from the GVQ trained codebooks are Less correlated, therefore, the sentence level speaker identification rate increases more quickly with the length of test sentences.
引用
收藏
页码:353 / 356
页数:4
相关论文
共 10 条
[1]  
[Anonymous], P ESCA WORKSH AUT SP
[2]  
CHEN MS, 1993, IEEE T SIGNAL PROCES, V41, P398, DOI 10.1109/TSP.1993.193163
[3]  
Fukunaga K., 1990, INTRO STAT PATTERN R, P51
[4]  
HE J, 1994, P INT C SPOK LANG PR, P1851
[5]  
HE J, P EUROSPEECH 97 RHOD, V2, P959
[6]   THE SELF-ORGANIZING MAP [J].
KOHONEN, T .
PROCEEDINGS OF THE IEEE, 1990, 78 (09) :1464-1480
[7]   ALGORITHM FOR VECTOR QUANTIZER DESIGN [J].
LINDE, Y ;
BUZO, A ;
GRAY, RM .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1980, 28 (01) :84-95
[8]   SIGNAL MODELING TECHNIQUES IN SPEECH RECOGNITION [J].
PICONE, JW .
PROCEEDINGS OF THE IEEE, 1993, 81 (09) :1215-1247
[9]   ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA ;
ROSE, RC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83
[10]  
ROSENBERG AE, 1987, COMPUTER SPEECH LANG, V22, P143