Minimum Kullback-Leibler distance based multivariate Gaussian feature adaptation for distant-talking speech recognition

被引:0
作者
Pan, Y [1 ]
Waibel, A [1 ]
机构
[1] Carnegie Mellon Univ, Interact Syst Labs, Pittsburgh, PA 15213 USA
来源
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multivariate Gaussian based speech compensation or mapping has been developed to reduce the mismatch between training and deployment conditions for robust speech recognition. The acoustic mapping procedure can be formulated as a feature space adaptation where input noisy signal is transformed by a multivariate Gaussian network. We propose a novel algorithm to update the network parameters based on minimizing the Kullback-Leibler distance between the core recognizer's acoustic model and transformed features. It is designed to achieve optimal overall system performance rather than MMSE on a specific feature domain. An online stochastic gradient descent learning rule is derived. We evaluate the performance of the new algorithm using JRTk Broadcast news system on a distance-talking speech corpus and compare its performance with that of previous MMSE based approaches. The experiments show the KL based approach is more effective for a large vocabulary continuous speech recognition (LVCSR) system.
引用
收藏
页码:1029 / 1032
页数:4
相关论文
共 4 条
[1]  
DENG L, 2000, P ICSLP
[2]  
LEGETTER CJ, 1995, COMPUTER SPEECH LANG, V9, P171
[3]  
MORENO PJ, 1995, P ICASSP
[4]  
Yu H., 2000, P ICSLP