Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish

被引：0

作者：

Majewski, Piotr ^{[1
]}

机构：

[1] Univ Lodz, Fac Math & Comp Sci, PL-90238 Lodz, Poland

来源：

TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2008年 / 5246卷

关键词：

Polish; large vocabulary continuous speech recognition; language modeling; sub-word units; syllable-based units;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of state-of-the-art large vocabulary continuous speech recognition systems use word-based n-gram language models. Such models are not optimal solution for inflectional or agglutinative languages. The Polish language is highly inflectional one and requires a very large corpora to create a sufficient language model with the small out-of-vocabulary ratio. We propose a syllable-based language model. which is better suited to highly inflectional language like Polish. In case of lack of resources (i.e. small corpora) syllable-based model outperforms word-based models in terms of number of out-of-vocabulary units (syllables in our model). Such model is an approximation of the morphene-based model for Polish. In our paper, we show results of evaluation of syllable based model and its usefulness in speech recognition tasks.

引用

页码：397 / 401

页数：5

共 50 条

[31] Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition [J].

Roux, Thibault Baneras ;

Rouvier, Mickael ;

Wottawa, Jane ;

Dufour, Richard .

INTERSPEECH 2022, 2022, :3968-3972

[32] A SYNCHRONIZED PRUNING COMPOSITION ALGORITHM OF WEIGHTED FINITE STATE TRANSDUCERS FOR LARGE VOCABULARY SPEECH RECOGNITION [J].

He, Zhiyang ;

Lv, Ping ;

Li, Wei ;

Wu, Ji .

2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, :11-15

[33] MULTI-LEVEL LANGUAGE MODELING AND DECODING FOR OPEN VOCABULARY END-TO-END SPEECH RECOGNITION [J].

Hori, Takaaki ;

Watanabe, Shinji ;

Hershey, John R. .

2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, :287-293

[34] Large-vocabulary continuous speech recognition using linear lexicon search and 1-best approximation tree-structured lexicon search [J].

Kitaoka, Norihide ;

Takahashi, Nobutoshi ;

Nakagawa, Seiichi .

Systems and Computers in Japan, 2005, 36 (07) :31-39

[35] DISCRIMINATIVELY ESTIMATED JOINT ACOUSTIC, DURATION, AND LANGUAGE MODEL FOR SPEECH RECOGNITION [J].

Lehr, Maider ;

Shafran, Izhak .

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5542-5545

[36] Web-based possibilistic language models for automatic speech recognition [J].

Oger, Stanislas ;

Linares, Georges .

COMPUTER SPEECH AND LANGUAGE, 2014, 28 (04) :923-939

[37] ANALYSIS OF MORPH-BASED LANGUAGE MODELING AND SPEECH RECOGNITION IN SLOVAK [J].

Stas, Jan ;

Hladek, Daniel ;

Juhar, Jozef ;

Zlacky, Daniel .

ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2012, 10 (04) :291-296

[38] Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition [J].

Feng, Yukun ;

Tu, Ming ;

Xia, Rui ;

Huang, Chuanzeng ;

Wang, Yuxuan .

INTERSPEECH 2023, 2023, :481-485

[39] Multi-Domain Recurrent Neural Network Language Model for Medical Speech Recognition [J].

Tilk, Ottokar ;

Alumaee, Tanel .

HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2014, 2014, 268 :149-+

[40] Discriminatively trained continuous Hindi speech recognition system using interpolated recurrent neural network language modeling [J].

Mohit Dua ;

R. K. Aggarwal ;

Mantosh Biswas .

Neural Computing and Applications, 2019, 31 :6747-6755

← 1 2 3 4 5 →