MODELING CHARACTERS VERSUS WORDS FOR MANDARIN SPEECH RECOGNITION

被引:7
作者
Luo, Jun [1 ]
Lamel, Lori [1 ]
Gauvain, Jean-Luc [1 ]
机构
[1] LIMSI, CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Speech recognition; language modeling; Mandarin Chinese; speech-to-text transcription;
D O I
10.1109/ICASSP.2009.4960586
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Word based models are widely used in speech recognition since they typically perform well. However, the question of whether it is better to use a word-based or a character-based model warrants being for the Mandarin Chinese language. Since Chinese is written without any spaces or word delimiters, a word segmentation algorithm is applied in a pre-processing step prior to training a word-based language model. Chinese characters carry meaning and speakers are free to combine characters to construct new words. This suggests that character information can also be useful in communication. This paper explores both word-based and character-based models, and their complementarity. Although word-based modeling is found to outperform character-based modeling, increasing the vocabulary size from 56k to 160k words did not lead to a gain in performance. Results are reported for the Gale Mandarin speech-to-text task.
引用
收藏
页码:4325 / 4328
页数:4
相关论文
共 10 条
  • [1] [Anonymous], 2006, Multilingual speech processing
  • [2] CHEN L, ICSLP 2000, V2, P1015
  • [3] CHEN SF, 1996, 34 ANN M ACL SOM NEW, P310
  • [4] Cheng KS, 1999, J AM SOC INFORM SCI, V50, P218, DOI 10.1002/(SICI)1097-4571(1999)50:3<218::AID-ASI4>3.0.CO
  • [5] 2-1
  • [6] FISCUS J, 1997, POSTPROCESSING SYSTE
  • [7] GREZL F, 2008, ICASSP 08
  • [8] Continuous space language models
    Schwenk, Holger
    [J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (03) : 492 - 518
  • [9] SPROAT R, 1996, COMPUT LINGUIST, V22, P218
  • [10] WU D, ANLP 94, P180