GENERATING COMPOUND WORDS WITH HIGH ORDER N-GRAM INFORMATION IN LARGE VOCABULARY SPEECH RECOGNITION SYSTEMS

被引:0
|
作者
Zhau, Lie
Shi, Qin
Qin, Yang
机构
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
speech recognition; compound words; high order; vocabulary; gradient criterion;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work we concentrate on generating compound words with high order n-gram information for speech recognition. In most existing compound words generation methods, only bi-gram information is considered. They are successful for improving the performance of bi-gram models but doesn't work well in higher order n-gram cases. Since nowadays 3-gram and 4-gram language models are commonly used, here we present a high order n-gram based computation to generate compound words automatically in an exact way which is called gradient criterion. We have this method tested on Mandarin Open Voice Search (OVS) task and make 0.62% absolute improvement over the 16.44% baseline. This result also outperforms the traditional mutual information based methods. Further the history effect and prediction effect of this criterion are tested and we find history effect plays a more important role in the decoding task.
引用
收藏
页码:5560 / 5563
页数:4
相关论文
共 21 条
  • [1] Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian
    Varjokallio, Matti
    Kurimo, Mikko
    Virpioja, Sami
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2016, 2016, 9918 : 133 - 144
  • [2] Importance of High-Order N-Gram Models in Morph-Based Speech Recognition
    Hirsimaki, Teemu
    Pylkkonen, Janne
    Kurimo, Mikko
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 724 - 732
  • [3] TOPIC N-GRAM COUNT LANGUAGE MODEL ADAPTATION FOR SPEECH RECOGNITION
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 165 - 169
  • [4] Investigation on LSTM Recurrent N-gram Language Models for Speech Recognition
    Tueske, Zoltan
    Schlueter, Ralf
    Ney, Hermann
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3358 - 3362
  • [5] Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model
    Tian, Jinchuan
    Yu, Jianwei
    Weng, Chao
    Zou, Yuexian
    Yu, Dong
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 812 - 816
  • [6] Boosting systems for large vocabulary continuous speech recognition
    Saon, George
    Soltau, Hagen
    SPEECH COMMUNICATION, 2012, 54 (02) : 212 - 218
  • [7] Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity
    Naptali, Welly
    Tsuchiya, Masatoshi
    Nakagawa, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (09) : 2308 - 2317
  • [8] Integrating Stress Information in Large Vocabulary Continuous Speech Recognition
    Ludusan, Bogdan
    Ziegler, Stefan
    Gravier, Guillaume
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2641 - 2644
  • [9] Part of Speech Tagging Approach to Designing Compound Words for Arabic Continuous Speech Recognition Systems
    AbuZeina, Dia
    Elshafei, Moustafa
    Al-Khatib, Wasfi
    INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT IV, 2011, 254 : 330 - 338
  • [10] An N-gram Based Chinese Syllable Evaluation Approach for Speech Recognition Error Detection
    Wang, Xingjian
    Li, Lei
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 224 - 229