Language-independent and language-adaptive acoustic modeling for speech recognition

被引:215
作者
Schultz, T [1 ]
Waibel, A
机构
[1] Univ Karlsruhe, Interact Syst Labs, D-76131 Karlsruhe, Germany
[2] Carnegie Mellon Univ, Interact Syst Labs, Pittsburgh, PA 15213 USA
关键词
language portability; multilingual acoustic models; large vocabulary continuous speech recognition; polyphone decision tree specialization (PDTS); GlobalPhone;
D O I
10.1016/S0167-6393(00)00094-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
With the distribution of speech technology products all over the world, the portability to new target languages becomes a practical concern. As a consequence our research focuses on the question of how to port large vocabulary continuous speech recognition (LVCSR) systems in a fast and efficient way. More specifically we want to estimate acoustic models for a new target language using speech data from varied source languages, but only limited data from the target language. For this purpose, we introduce different methods for multilingual acoustic model combination and a polyphone decision tree specialization procedure. Recognition results using language-dependent, independent and language-adaptive acoustic models are presented and discussed in the framework of our GlobalPhone project which investigates LVCSR systems in 15 languages. (C) 2001 Published by Elsevier Science B.V.
引用
收藏
页码:31 / 51
页数:21
相关论文
empty
未找到相关数据