Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish

被引：0

作者：

Majewski, Piotr ^{[1
]}

机构：

[1] Univ Lodz, Fac Math & Comp Sci, PL-90238 Lodz, Poland

来源：

TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2008年 / 5246卷

关键词：

Polish; large vocabulary continuous speech recognition; language modeling; sub-word units; syllable-based units;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of state-of-the-art large vocabulary continuous speech recognition systems use word-based n-gram language models. Such models are not optimal solution for inflectional or agglutinative languages. The Polish language is highly inflectional one and requires a very large corpora to create a sufficient language model with the small out-of-vocabulary ratio. We propose a syllable-based language model. which is better suited to highly inflectional language like Polish. In case of lack of resources (i.e. small corpora) syllable-based model outperforms word-based models in terms of number of out-of-vocabulary units (syllables in our model). Such model is an approximation of the morphene-based model for Polish. In our paper, we show results of evaluation of syllable based model and its usefulness in speech recognition tasks.

引用

页码：397 / 401

页数：5

共 50 条

[41] Discriminatively trained continuous Hindi speech recognition system using interpolated recurrent neural network language modeling [J].

Dua, Mohit ;

Aggarwal, R. K. ;

Biswas, Mantosh .

NEURAL COMPUTING & APPLICATIONS, 2019, 31 (10) :6747-6755

[42] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition [J].

Sun, Xie ;

Zhao, Yunxin .

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,

[43] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition [J].

Xie Sun ;

Yunxin Zhao .

EURASIP Journal on Audio, Speech, and Music Processing, 2014

[44] END-TO-END SPEECH RECOGNITION WITH WORD-BASED RNN LANGUAGE MODELS [J].

Hori, Takaaki ;

Cho, Jaejin ;

Watanabe, Shinji .

2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, :389-396

[45] Language modeling in speech recognition for grammatical error detection based on neural machine translation [J].

Fu, Jiang ;

Chiba, Yuya ;

Nose, Takashi ;

Ito, Akinori .

ACOUSTICAL SCIENCE AND TECHNOLOGY, 2020, 41 (05) :788-791

[46] MORPHOLOGY-BASED AND SUB-WORD LANGUAGE MODELING FOR TURKISH SPEECH RECOGNITION [J].

Sak, Hasim ;

Saraclar, Murat ;

Gungor, Tunga .

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5402-5405

[47] Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0 [J].

Schlippe, Tim ;

Gren, Lukasz ;

Vu, Ngoc Thang ;

Schultz, Tanja .

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, :2697-2701

[48] Using Dependency Grammar Features in Whole Sentence Maximum Entropy Language Model for Speech Recognition [J].

Ruokolainen, Teemu ;

Alumaee, Tanel ;

Dobrinkat, Marcus .

HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2010, 219 :73-79

[49] EXPLOITING DIFFERENT WORD CLUSTERINGS FOR CLASS-BASED RNN LANGUAGE MODELING IN SPEECH RECOGNITION [J].

Song, Minguang ;

Zhao, Yunxin ;

Wang, Shaojun .

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, :5735-5739

[50] Bag-of-Words Input for Long History Representation in Neural Network-based Language Models for Speech Recognition [J].

Irie, Kazuki ;

Schlueter, Ralf ;

Ney, Hermann .

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, :2371-2375

← 1 2 3 4 5 →