Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish

被引:0
作者
Majewski, Piotr [1 ]
机构
[1] Univ Lodz, Fac Math & Comp Sci, PL-90238 Lodz, Poland
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2008年 / 5246卷
关键词
Polish; large vocabulary continuous speech recognition; language modeling; sub-word units; syllable-based units;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of state-of-the-art large vocabulary continuous speech recognition systems use word-based n-gram language models. Such models are not optimal solution for inflectional or agglutinative languages. The Polish language is highly inflectional one and requires a very large corpora to create a sufficient language model with the small out-of-vocabulary ratio. We propose a syllable-based language model. which is better suited to highly inflectional language like Polish. In case of lack of resources (i.e. small corpora) syllable-based model outperforms word-based models in terms of number of out-of-vocabulary units (syllables in our model). Such model is an approximation of the morphene-based model for Polish. In our paper, we show results of evaluation of syllable based model and its usefulness in speech recognition tasks.
引用
收藏
页码:397 / 401
页数:5
相关论文
共 50 条
[41]   Discriminatively trained continuous Hindi speech recognition system using interpolated recurrent neural network language modeling [J].
Dua, Mohit ;
Aggarwal, R. K. ;
Biswas, Mantosh .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (10) :6747-6755
[42]   Integrated exemplar-based template matching and statistical modeling for continuous speech recognition [J].
Sun, Xie ;
Zhao, Yunxin .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
[43]   Integrated exemplar-based template matching and statistical modeling for continuous speech recognition [J].
Xie Sun ;
Yunxin Zhao .
EURASIP Journal on Audio, Speech, and Music Processing, 2014
[44]   END-TO-END SPEECH RECOGNITION WITH WORD-BASED RNN LANGUAGE MODELS [J].
Hori, Takaaki ;
Cho, Jaejin ;
Watanabe, Shinji .
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, :389-396
[45]   Language modeling in speech recognition for grammatical error detection based on neural machine translation [J].
Fu, Jiang ;
Chiba, Yuya ;
Nose, Takashi ;
Ito, Akinori .
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2020, 41 (05) :788-791
[46]   MORPHOLOGY-BASED AND SUB-WORD LANGUAGE MODELING FOR TURKISH SPEECH RECOGNITION [J].
Sak, Hasim ;
Saraclar, Murat ;
Gungor, Tunga .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5402-5405
[47]   Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0 [J].
Schlippe, Tim ;
Gren, Lukasz ;
Vu, Ngoc Thang ;
Schultz, Tanja .
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, :2697-2701
[48]   Using Dependency Grammar Features in Whole Sentence Maximum Entropy Language Model for Speech Recognition [J].
Ruokolainen, Teemu ;
Alumaee, Tanel ;
Dobrinkat, Marcus .
HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2010, 219 :73-79
[49]   EXPLOITING DIFFERENT WORD CLUSTERINGS FOR CLASS-BASED RNN LANGUAGE MODELING IN SPEECH RECOGNITION [J].
Song, Minguang ;
Zhao, Yunxin ;
Wang, Shaojun .
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, :5735-5739
[50]   Bag-of-Words Input for Long History Representation in Neural Network-based Language Models for Speech Recognition [J].
Irie, Kazuki ;
Schlueter, Ralf ;
Ney, Hermann .
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, :2371-2375