RAPID BOOTSTRAPPING OF A UKRAINIAN LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM

被引:0
作者
Schlippe, Tim [1 ]
Volovyk, Mykola [1 ]
Yurchenko, Kateryna [1 ]
Schultz, Tanja [1 ]
机构
[1] Karlsruhe Inst Technol, Cognit Syst Lab, D-76021 Karlsruhe, Germany
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
speech recognition; rapid language adaptation; Ukrainian; Slavic language; pronunciation dictionary;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We report on our efforts toward an LVCSR system for the Slavic language Ukrainian. We describe the Ukrainian text and speech database recently collected as a part of our GlobalPhone corpus [1] with our Rapid Language Adaptation Toolkit [2]. The data was complemented by a large collection of text data crawled from various Ukrainian websites. For the production of the pronunciation dictionary, we investigate strategies using grapheme-to-phoneme (g2p) models derived from existing dictionaries of other languages, thereby reducing severely the necessary manual effort. Russian and Bulgarian g2p models even decrease the number of pronunciation rules to one fifth. We achieve significant improvement by applying state-of-the art techniques for acoustic modeling and our day-wise text collection and language model interpolation strategy [3]. Our best system achieves a word error rate of 11.21 % on the test set on read newspaper speech.
引用
收藏
页码:7329 / 7333
页数:5
相关论文
共 30 条
  • [1] [Anonymous], 2013, ICASSP
  • [2] [Anonymous], 2001, Ukrainian Population Census
  • [3] [Anonymous], ICASSP
  • [4] [Anonymous], 2010, INTERSPEECH
  • [5] [Anonymous], ICASSP
  • [6] [Anonymous], 2010, INTERSPEECH
  • [7] [Anonymous], PRASA
  • [8] Besling S., 1994, KONVENS
  • [9] Bilous T., 2005, IPA UKRAINIAN
  • [10] Bisani M., 2008, SPEECH COMMUNICATION