Lattice Decoding and Rescoring with Long-Span Neural Network Language Models

被引:0
作者
Sundermeyer, Martin [1 ]
Tueske, Zoltcin [1 ]
Schlueter, Ralf [1 ]
Ney, Hermann [1 ,2 ]
机构
[1] Rhein Westfal TH Aachen, Human Language Technol & Pattern Recognit, Dept Comp Sci, Aachen, Germany
[2] LIMSI CNRS, Spoken Language Proc Grp, Paris, France
来源
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | 2014年
关键词
speech recognition; language modeling; recurrent neural networks; long short-term memory; word lattices;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With long-span neural network language models, considerable improvements have been obtained in speech recognition. However, it is difficult to apply these models if the underlying search space is large. In this paper, we combine previous work on lattice decoding with long short-term memory (LSTM) neural network language 'models. By adding refined pruning techniques, we are able to reduce the search effort by a factor of three. Furthermore, we introduce two novel approximations for full lattice rescoring, which opens the potential of lattice-based speech recognition techniques. Compared to 1000-best lists, we find that we can increase the word error rate improvements obtained with LSTMs from 8.2 % to 10.7 % relative over a stateof-the-art baseline, while the resulting lattices are even considerably smaller. In addition, we investigate the use of LSTMs for Babel Assamese keyword search, obtaining significant improvements of 2.5 % relative.
引用
收藏
页码:661 / 665
页数:5
相关论文
共 27 条
[11]  
Kneser R., P ICASSP 1995, P181
[12]  
Kneser R., P QUALICO 1991, P221
[13]  
Knill K. M., P ASRU 2013, P138
[14]  
Kombrink S., P INT 2011, P2877
[15]  
Mikolov T., P INT 2010, P1045
[16]  
Mikolov T., P ICASSP 2011, P5528
[17]  
Mohri M., 2008, Speech Recognition with Weighted Finite-State Transducers, P559, DOI DOI 10.1007/978-3-540-49127-9_28
[18]  
Morin F., 2005, INT WORKSHOP ARTIFIC, V5, P246
[19]   Progress in dynamic programming search for LVCSR [J].
Ney, H ;
Ortmanns, S .
PROCEEDINGS OF THE IEEE, 2000, 88 (08) :1224-1240
[20]  
Povey D, P ICASSP 2002, P105