GENERATING COMPOUND WORDS WITH HIGH ORDER N-GRAM INFORMATION IN LARGE VOCABULARY SPEECH RECOGNITION SYSTEMS

被引：0

作者：

Zhau, Lie

Shi, Qin

Qin, Yang

机构：

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

speech recognition; compound words; high order; vocabulary; gradient criterion;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this work we concentrate on generating compound words with high order n-gram information for speech recognition. In most existing compound words generation methods, only bi-gram information is considered. They are successful for improving the performance of bi-gram models but doesn't work well in higher order n-gram cases. Since nowadays 3-gram and 4-gram language models are commonly used, here we present a high order n-gram based computation to generate compound words automatically in an exact way which is called gradient criterion. We have this method tested on Mandarin Open Voice Search (OVS) task and make 0.62% absolute improvement over the 16.44% baseline. This result also outperforms the traditional mutual information based methods. Further the history effect and prediction effect of this criterion are tested and we find history effect plays a more important role in the decoding task.

引用

页码：5560 / 5563

页数：4

共 21 条

[1] Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian
Varjokallio, Matti
Kurimo, Mikko
Virpioja, Sami
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2016, 2016, 9918 : 133 - 144
[2] Importance of High-Order N-Gram Models in Morph-Based Speech Recognition
Hirsimaki, Teemu
Pylkkonen, Janne
Kurimo, Mikko
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 724 - 732
[3] TOPIC N-GRAM COUNT LANGUAGE MODEL ADAPTATION FOR SPEECH RECOGNITION
Haidar, Md. Akmal
O'Shaughnessy, Douglas
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 165 - 169
[4] Investigation on LSTM Recurrent N-gram Language Models for Speech Recognition
Tueske, Zoltan
Schlueter, Ralf
Ney, Hermann
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3358 - 3362
[5] Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model
Tian, Jinchuan
Yu, Jianwei
Weng, Chao
Zou, Yuexian
Yu, Dong
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 812 - 816
[6] Boosting systems for large vocabulary continuous speech recognition
Saon, George
Soltau, Hagen
SPEECH COMMUNICATION, 2012, 54 (02) : 212 - 218
[7] Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity
Naptali, Welly
Tsuchiya, Masatoshi
Nakagawa, Seiichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (09) : 2308 - 2317
[8] Integrating Stress Information in Large Vocabulary Continuous Speech Recognition
Ludusan, Bogdan
Ziegler, Stefan
Gravier, Guillaume
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2641 - 2644
[9] Part of Speech Tagging Approach to Designing Compound Words for Arabic Continuous Speech Recognition Systems
AbuZeina, Dia
Elshafei, Moustafa
Al-Khatib, Wasfi
INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT IV, 2011, 254 : 330 - 338
[10] An N-gram Based Chinese Syllable Evaluation Approach for Speech Recognition Error Detection
Wang, Xingjian
Li, Lei
IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 224 - 229

← 1 2 3 →