A usage of the syllable unit based on morphological statistics in Korean large vocabulary continuous speech recognition system

被引:0
作者
Hyok-Chol Ri
机构
[1] KIM IL SUNG University,College of Information Science
来源
International Journal of Speech Technology | 2019年 / 22卷
关键词
Recognition unit; Language model; Morpheme; Syllable;
D O I
暂无
中图分类号
学科分类号
摘要
In large vocabulary continuous speech recognition (LVCSR), it is important in improving the system’s performance to determine reasonably the recognition unit. In Korean continuous speech recognition, a morph rather than a word is used basically as the recognition unit due to Korean’s agglutinative property and a good performance is provided by combining high-frequency morph sequences, which leading to an increase of vocabulary size and high out-of-vocabulary (OOV) rate. Sub-lexical units such as a syllable and a graphone are widely used for inflectional languages, while they have not been introduced successfully for Korean speech recognition, due to a weakness of their linguistic information. In this paper, we investigate a usage of a syllable unit to resolve a mismatch problem between the recognition unit and vocabulary size that have occurred frequently in Korean large vocabulary speech recognition. We apply the local segmentation into syllables based on morphological statistics and perform experiments using the language model (LM) constructed from mixed unit types of morpheme, combined morpheme and syllable. By the proposed model, an absolute reduction of around 0.4% in word error rate (WER) is obtained compared to a traditional LM consisting of morphemes and combined morphemes.
引用
收藏
页码:971 / 977
页数:6
相关论文
共 22 条
  • [1] Creutz M(2007)Morph-based speech recognition and modeling of out of- vocabulary words across languages ACM Transactions on Speech and Language Processing 5 3-243
  • [2] Hirsimäki T(2012)Morphological decomposition in Arabic ASR systems Computer Speech and Language 26 229-541
  • [3] Kurimo M(2006)Unlimited vocabulary speech recognition with morph language models applied to Finish Computer Speech and Language 20 515-684
  • [4] Puurula A(2010)Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition Computer Speech and Language 24 663-401
  • [5] Pylkkönen J(2008)Syllable based language model for large vocabulary continuous speech recognition of polish Text, Speech and Dialogue, ser. Lecture Notes in Computer Science 5246 397-537
  • [6] Siivola V(2007)Large vocabulary continuous speech recognition of an inflected language using stems and endings Speech Communication 49 452-608
  • [7] Varjokallio M(2006)Morphology-based language modeling for conversational Arabic speech recognition Computer Speech and Language 20 589-41
  • [8] Arisoy E(2003)Statistical language modeling based on variable-length sequences Computer Speech and Language 17 27-undefined
  • [9] Saraclar M(undefined)undefined undefined undefined undefined-undefined
  • [10] Stolcke A(undefined)undefined undefined undefined undefined-undefined