COMPARISON OF HIDDEN MARKOV MODEL TECHNIQUES FOR AUTOMATIC SPEAKER VERIFICATION IN REAL-WORLD CONDITIONS

被引:5
作者
DEVETH, J
BOURLARD, H
机构
[1] INT COMP SCI INST, BERKELEY, CA 94704 USA
[2] FAC POLYTECH MONS, B-7000 MONS, BELGIUM
关键词
SPEAKER VERIFICATION; TELEPHONE SPEECH; LIMITED TRAINING DATA; TIED MULTI-GAUSSIAN HMMS; SINGLE GAUSSIAN HMMS;
D O I
10.1016/0167-6393(95)00015-G
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we compare two alternative approaches for speaker verification based on hidden Markov model (HMM) technology: single Gaussian HMMs and different types of tied multi-Gaussian HMMs. In order to assess the performance under real-world constraints, we tested each system using a database of connected digit strings recorded over local and long-distance telephone lines. According to our experiments, tied-mixture models were able to perform better than the single Gaussian approach provided that sufficient training data were available. However, our experiments indicate that the single Gaussian HMM approach is to be preferred for real-world speaker verification when only limited amounts of training data are available. Results will be discussed for both text-dependent and text-independent speaker verification.
引用
收藏
页码:81 / 90
页数:10
相关论文
共 13 条
[1]  
DEVETH J, 1993, 1993 P INT C AC SPEE, P247
[2]  
DEVETH J, 1993, 1993 P EUR 93 BERL, P2279
[3]   HMM SPEAKER VERIFICATION WITH SPARSE TRAINING DATA ON TELEPHONE QUALITY SPEECH [J].
FORSYTH, ME ;
SUTHERLAND, AM ;
ELLIOTT, JA ;
JACK, MA .
SPEECH COMMUNICATION, 1993, 13 (3-4) :411-416
[4]  
GODFREY J, 1994, 1994 P ESCA WORKSH A, P39
[5]  
HERMANSKY H, 1991, 1991 P EUR C SPEECH, P1367
[6]  
Higgins A., 1991, Digital Signal Processing, V1, P89, DOI 10.1016/1051-2004(91)90098-6
[7]  
HUANG XD, 1989, 1989 P EUR 89 PAR, P163
[8]  
Li K. P., 1998, P IEEE INT C AC SPEE, V1, P595
[9]  
MATSUI T, 1993, 1993 P INT C AC SPEE, P391
[10]  
MATSUI T, 1992, 1992 P ICSLP 92 BANF, P603