Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish

被引:0
作者
Majewski, Piotr [1 ]
机构
[1] Univ Lodz, Fac Math & Comp Sci, PL-90238 Lodz, Poland
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2008年 / 5246卷
关键词
Polish; large vocabulary continuous speech recognition; language modeling; sub-word units; syllable-based units;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of state-of-the-art large vocabulary continuous speech recognition systems use word-based n-gram language models. Such models are not optimal solution for inflectional or agglutinative languages. The Polish language is highly inflectional one and requires a very large corpora to create a sufficient language model with the small out-of-vocabulary ratio. We propose a syllable-based language model. which is better suited to highly inflectional language like Polish. In case of lack of resources (i.e. small corpora) syllable-based model outperforms word-based models in terms of number of out-of-vocabulary units (syllables in our model). Such model is an approximation of the morphene-based model for Polish. In our paper, we show results of evaluation of syllable based model and its usefulness in speech recognition tasks.
引用
收藏
页码:397 / 401
页数:5
相关论文
共 50 条
[31]   Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition [J].
Roux, Thibault Baneras ;
Rouvier, Mickael ;
Wottawa, Jane ;
Dufour, Richard .
INTERSPEECH 2022, 2022, :3968-3972
[32]   A SYNCHRONIZED PRUNING COMPOSITION ALGORITHM OF WEIGHTED FINITE STATE TRANSDUCERS FOR LARGE VOCABULARY SPEECH RECOGNITION [J].
He, Zhiyang ;
Lv, Ping ;
Li, Wei ;
Wu, Ji .
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, :11-15
[33]   MULTI-LEVEL LANGUAGE MODELING AND DECODING FOR OPEN VOCABULARY END-TO-END SPEECH RECOGNITION [J].
Hori, Takaaki ;
Watanabe, Shinji ;
Hershey, John R. .
2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, :287-293
[34]   Large-vocabulary continuous speech recognition using linear lexicon search and 1-best approximation tree-structured lexicon search [J].
Kitaoka, Norihide ;
Takahashi, Nobutoshi ;
Nakagawa, Seiichi .
Systems and Computers in Japan, 2005, 36 (07) :31-39
[35]   DISCRIMINATIVELY ESTIMATED JOINT ACOUSTIC, DURATION, AND LANGUAGE MODEL FOR SPEECH RECOGNITION [J].
Lehr, Maider ;
Shafran, Izhak .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5542-5545
[36]   Web-based possibilistic language models for automatic speech recognition [J].
Oger, Stanislas ;
Linares, Georges .
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (04) :923-939
[37]   ANALYSIS OF MORPH-BASED LANGUAGE MODELING AND SPEECH RECOGNITION IN SLOVAK [J].
Stas, Jan ;
Hladek, Daniel ;
Juhar, Jozef ;
Zlacky, Daniel .
ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2012, 10 (04) :291-296
[38]   Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition [J].
Feng, Yukun ;
Tu, Ming ;
Xia, Rui ;
Huang, Chuanzeng ;
Wang, Yuxuan .
INTERSPEECH 2023, 2023, :481-485
[39]   Multi-Domain Recurrent Neural Network Language Model for Medical Speech Recognition [J].
Tilk, Ottokar ;
Alumaee, Tanel .
HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2014, 2014, 268 :149-+
[40]   Discriminatively trained continuous Hindi speech recognition system using interpolated recurrent neural network language modeling [J].
Mohit Dua ;
R. K. Aggarwal ;
Mantosh Biswas .
Neural Computing and Applications, 2019, 31 :6747-6755