SPEECH RECOGNITION OF FOREIGN OUT-OF-VOCABULARY WORDS USING A HIERARCHICAL LANGUAGE MODEL

被引:0
|
作者
Yamamoto, Hirofumi [1 ]
Kikui, Genichiro [2 ]
Nakamura, Satoshi [1 ,2 ]
Sagisaka, Yoshinori [1 ,3 ]
机构
[1] Natl Inst Informat & Commun Technol, 2-2-2 Hikaridai, Seika, Kyoto, Japan
[2] ATR Spoken Language Commun Res Labs, Kyoto, Japan
[3] Waseda Univ, GITI, Tokyo, Japan
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
Speech Recognition; Language model; Foreign word; Out-of-Vocabulalry word; Hierarchical language model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new speech recognition scheme for foreign out-of-vocabulary words embedded in native-language speech. To recognize foreign names frequently observed in news speech or in translation speech, we adopted a hierarchical language model that had been successfully applied to OOV words covering native vocabularies. In this hierarchical language model, OOV vocabularies are modeled as a word-class model in the upper-layered model, and their statistical phonotactic constraints are modeled in the lower-layered model. Since extra statistics are needed to cover foreign words and their pronunciation differences, we have introduced two techniques. The first is to combine translation target language models and translation source statistics of OOVs using the hierarchical language model. The second is to automatically generate recognition target pronunciations from original pronunciations by syllable-to-syllable mapping. To confirm the validity of this recognition scheme, we have conducted speech recognition experiments using English speech including Japanese personal names as OOV words. The proposed method outperformed the existing algorithm using a lexicon consisting of all the words in the training set. Surprisingly, it achieved better OOV recognition results than the non-OOV condition where all the proper names in the test set are registered in the lexicon.
引用
收藏
页码:1870 / +
页数:2
相关论文
共 50 条
  • [11] Out-of-vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System
    Egorova, Ekaterina
    Vydana, Hari Krishna
    Burget, Lukas
    Cernocky, Jan
    INTERSPEECH 2021, 2021, : 2901 - 2905
  • [12] USING SYNTACTIC AND CONFUSION NETWORK STRUCTURE FOR OUT-OF-VOCABULARY WORD DETECTION
    Marin, Alex
    Kwiatkowski, Tom
    Ostendorf, Mari
    Zettlemoyer, Luke
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 159 - 164
  • [13] On the Adaptation of Foreign Language Speech Recognition Engines for Lithuanian Speech Recognition
    Rudzionis, Vytautas
    Maskeliunas, Rytis
    Rudzionis, Algimantas
    Ratkevicius, Kastytis
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, 2009, 37 : 113 - +
  • [14] Improving the Performance of Out-of-vocabulary Word Rejection by Using Support Vector Machines
    Huang Shilei
    Xie Xiang
    Kuang Jingming
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1618 - 1621
  • [15] Automatic construction of FSA language model for middle-size vocabulary speech recognition
    Morimoto, Tsuyoshi
    Takahashi, Shin-ya
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 614 - +
  • [16] Japanese Personal Name and Location Search for Spoken Utterances by Using Hierarchical Language Model of Speech Recognition
    Hu, Xinhui
    Wu, Youzheng
    Kashioka, Hideki
    RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES, 2008, : 193 - 198
  • [17] Large vocabulary speech recognition with multispan statistical language models
    Bellegarda, JR
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 76 - 84
  • [18] Dialogue speech recognition by combining hierarchical topic classification and language model switching
    Lane, IR
    Kawahara, T
    Matsui, T
    Nakamura, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 446 - 454
  • [19] A large vocabulary continuous speech recognition system for Persian language
    Hossein Sameti
    Hadi Veisi
    Mohammad Bahrani
    Bagher Babaali
    Khosro Hosseinzadeh
    EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [20] Topic tracking language model for speech recognition
    Watanabe, Shinji
    Iwata, Tomoharu
    Hori, Takaaki
    Sako, Atsushi
    Ariki, Yasuo
    COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02) : 440 - 461