Construction of model-space constraints

被引:0
作者
Nguyen, P [1 ]
Rigazio, L [1 ]
Wellekens, C [1 ]
Junqua, JC [1 ]
机构
[1] Panason Speech Technol Lab, Santa Barbara, CA USA
来源
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS | 2001年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
HMM systems exhibit a large amount of redundancy. To this end, a technique called Eigenvoices was found to be very effective for speaker adaptation. The correlation between HMM parameters is exploited via a linear constraint called eigenspace. This constraint is obtained through a PCA analysis of the training speakers. In this paper, we show how PCA can be linked to the maximum-likelihood criterion. Then, we extend the method to LDA transformations and piecewise linear constraints. On the Wall Street Journal (WSJ) dictation task, we obtain 1.7% WER improvement (15% relative) when using self-adaptation.
引用
收藏
页码:69 / 72
页数:4
相关论文
共 8 条
[1]  
BACCHIAN M, 2000, P ICSLP BEIJ CHIN OC, V4, P536
[2]   Cluster adaptive training of hidden Markov models [J].
Gales, MJF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04) :417-428
[3]  
Hart P.E., 1973, Pattern recognition and scene analysis
[4]   Rapid speaker adaptation in eigenvoice space [J].
Kuhn, R ;
Junqua, JC ;
Nguyen, P ;
Niedzielski, N .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06) :695-707
[5]   MAXIMUM-LIKELIHOOD LINEAR-REGRESSION FOR SPEAKER ADAPTATION OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].
LEGGETTER, CJ ;
WOODLAND, PC .
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :171-185
[6]  
NGUYEN P, 1999, P EUR SEP, V6, P2519
[7]  
NGUYEN P, 2000, P ICSLP BEIJ CHIN OC, V4, P286
[8]  
WANG N, 2001, P ICASSP, V1, P317