EFFICIENT LATTICE RESCORING USING RECURRENT NEURAL NETWORK LANGUAGE MODELS

被引:0
|
作者
Liu, X. [1 ]
Wang, Y. [1 ]
Chen, X. [1 ]
Gales, M. J. F. [1 ]
Woodland, P. C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
recurrent neural network; language model; speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recurrent neural network language models (RNNLM) have become an increasingly popular choice for state-of-the-art speech recognition systems due to their inherently strong generalization performance. As these models use a vector representation of complete history contexts, RNNLMs are normally used to rescore N-best lists. Motivated by their intrinsic characteristics, two novel lattice rescoring methods for RNNLMs are investigated in this paper. The first uses an n-gram style clustering of history contexts. The second approach directly exploits the distance measure between hidden history vectors. Both methods produced 1-best performance comparable with a 10k-best rescoring baseline RNNLM system on a large vocabulary conversational telephone speech recognition task. Significant lattice size compression of over 70% and consistent improvements after confusion network (CN) decoding were also obtained over the N-best rescoring approach.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models
    Liu, Xunying
    Chen, Xie
    Wang, Yongqiang
    Gales, Mark J. F.
    Woodland, Philip C.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (08) : 1438 - 1449
  • [2] Lattice Decoding and Rescoring with Long-Span Neural Network Language Models
    Sundermeyer, Martin
    Tueske, Zoltcin
    Schlueter, Ralf
    Ney, Hermann
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 661 - 665
  • [3] Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
    Khassanov, Yerbolat
    Chng, Eng Siong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3343 - 3347
  • [4] DISCRIMINATIVE METHOD FOR RECURRENT NEURAL NETWORK LANGUAGE MODELS
    Tachioka, Yuuki
    Watanabe, Shinji
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5386 - 5390
  • [5] RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR KEYWORD SEARCH
    Chen, X.
    Ragni, A.
    Vasilakes, J.
    Liu, X.
    Knill, K.
    Gales, M. J. F.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5775 - 5779
  • [6] SCALING RECURRENT NEURAL NETWORK LANGUAGE MODELS
    Williams, Will
    Prasad, Niranjani
    Mrva, David
    Ash, Tom
    Robinson, Tony
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5391 - 5395
  • [7] Efficient GPU-based Training of Recurrent Neural Network Language Models Using Spliced Sentence Bunch
    Chen, X.
    Wang, Y.
    Liu, X.
    Gales, M. J. F.
    Woodland, P. C.
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 641 - 645
  • [8] Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition
    Chen, Xie
    Liu, Xunying
    Wang, Yongqiang
    Gales, Mark J. F.
    Woodland, Philip C.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2146 - 2157
  • [9] PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE
    Liu, X.
    Chen, X.
    Gales, M. J. F.
    Woodland, P. C.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5406 - 5410
  • [10] On Efficient Training of Word Classes and Their Application to Recurrent Neural Network Language Models
    Botros, Rami
    Irie, Kazuki
    Sundermeyer, Martin
    Ney, Hermann
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1443 - 1447