SPEECH RECOGNITION OF FOREIGN OUT-OF-VOCABULARY WORDS USING A HIERARCHICAL LANGUAGE MODEL

被引:0
作者
Yamamoto, Hirofumi [1 ]
Kikui, Genichiro [2 ]
Nakamura, Satoshi [1 ,2 ]
Sagisaka, Yoshinori [1 ,3 ]
机构
[1] Natl Inst Informat & Commun Technol, 2-2-2 Hikaridai, Seika, Kyoto, Japan
[2] ATR Spoken Language Commun Res Labs, Kyoto, Japan
[3] Waseda Univ, GITI, Tokyo, Japan
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
Speech Recognition; Language model; Foreign word; Out-of-Vocabulalry word; Hierarchical language model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new speech recognition scheme for foreign out-of-vocabulary words embedded in native-language speech. To recognize foreign names frequently observed in news speech or in translation speech, we adopted a hierarchical language model that had been successfully applied to OOV words covering native vocabularies. In this hierarchical language model, OOV vocabularies are modeled as a word-class model in the upper-layered model, and their statistical phonotactic constraints are modeled in the lower-layered model. Since extra statistics are needed to cover foreign words and their pronunciation differences, we have introduced two techniques. The first is to combine translation target language models and translation source statistics of OOVs using the hierarchical language model. The second is to automatically generate recognition target pronunciations from original pronunciations by syllable-to-syllable mapping. To confirm the validity of this recognition scheme, we have conducted speech recognition experiments using English speech including Japanese personal names as OOV words. The proposed method outperformed the existing algorithm using a lexicon consisting of all the words in the training set. Surprisingly, it achieved better OOV recognition results than the non-OOV condition where all the proper names in the test set are registered in the lexicon.
引用
收藏
页码:1870 / +
页数:2
相关论文
共 50 条
[41]   Language Model for Speech Recognition of Power Grid Dispatching Based on BERT [J].
Chen L. ;
Zheng W. ;
Yu H. ;
Fu J. ;
Liu H. ;
Xia J. .
Dianwang Jishu/Power System Technology, 2021, 45 (08) :2955-2961
[42]   LANGUAGE MODEL BOOTSTRAPPING USING NEURAL MACHINE TRANSLATION FOR CONVERSATIONAL SPEECH RECOGNITION [J].
Punjabi, Surabhi ;
Arsikere, Harish ;
Garimella, Sri .
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, :487-493
[43]   Research on Syllable-Based Language Model in Malay Speech Recognition [J].
Wei, Xiangfeng ;
Zhang, Quan ;
Yuan, Yi .
2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, :150-155
[44]   A study of speech recognition based on RNN-RBM language model [J].
Li, Yaxiong ;
Zhang, Jianqiang ;
Pan, Deng ;
Hu, Dan .
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2014, 51 (09) :1936-1944
[45]   Feature Extraction Techniques with Analysis of Confusing Words for Speech Recognition in the Hindi Language [J].
Bhatt, Shobha ;
Jain, Anurag ;
Dev, Amita .
WIRELESS PERSONAL COMMUNICATIONS, 2021, 118 (04) :3303-3333
[46]   Feature Extraction Techniques with Analysis of Confusing Words for Speech Recognition in the Hindi Language [J].
Shobha Bhatt ;
Anurag Jain ;
Amita Dev .
Wireless Personal Communications, 2021, 118 :3303-3333
[47]   Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition [J].
Sun, Ri Hyon ;
Chol, Ri Jong .
SPEECH COMMUNICATION, 2020, 117 :21-27
[48]   A language model using variable length tokens for open-vocabulary Hangul text recognition [J].
Ryu, SH ;
Kim, JH .
PATTERN RECOGNITION, 2004, 37 (07) :1549-1552
[49]   Recognition of voice commands using adaptation of foreign language speech recognizer via selection of phonetic transcriptions [J].
Maskeliunas, Rytis ;
Rudzionis, Vytautas .
OPEN ENGINEERING, 2011, 1 (02) :181-188
[50]   Hiligaynon Language 5-Word Vocabulary Speech Recognition Using Mel Frequency Cepstrum Coefficients and Genetic Algorithm [J].
Billones, Robert Kerwin C. ;
Dadios, Elmer P. .
2014 INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2014,