MULTI-STYLE MLP FEATURES FOR BN TRANSCRIPTION

被引:6
|
作者
Le, Viet-Bac [1 ]
Lamel, Lori [1 ]
Gauvain, Jean-Luc [1 ]
机构
[1] LIMSI CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
MLP features; condition-specific adaptation; BN transcription;
D O I
10.1109/ICASSP.2010.5495116
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It has become common practice to adapt acoustic models to specific-conditions (gender, accent, bandwidth) in order to improve the performance of speech-to-text (STT) transcription systems. With the growing interest in the use of discriminative features produced by a multi layer perceptron (MLP) in such systems, the question arise of whether it is necessary to specialize the MLP to particular conditions, and if so, how to incorporate the condition-specific MLP features in the system. This paper explores three approaches (adaptation, full training, and feature merging) to use condition-specific MLP features in a state-of-the-art BN STT system for French. The third approach without condition-specific adaptation was found to outperform the original models with condition-specific adaptation, and was found to perform almost as well as full training of multiple condition-specific HMMs.
引用
收藏
页码:4866 / 4869
页数:4
相关论文
empty
未找到相关数据