COMPARISON OF FEEDFORWARD AND RECURRENT NEURAL NETWORK LANGUAGE MODELS

被引:0
作者
Sundermeyer, M. [1 ]
Oparin, I. [2 ]
Gauvain, J. -L. [2 ]
Freiberg, B. [1 ]
Schlueter, R. [1 ]
Ney, H. [1 ,2 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Human Language Technol & Pattern Recognit, Aachen, Germany
[2] CNRS, LIMSI, Spoken Language Proc Grp, F-75700 Paris, France
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Automatic speech recognition; feedforward neural networks; recurrent neural networks;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Research on language modeling for speech recognition has increasingly focused on the application of neural networks. Two competing concepts have been developed: On the one hand, feedforward neural networks representing an n-gram approach, on the other hand recurrent neural networks that may learn context dependencies spanning more than a fixed number of predecessor words. To the best of our knowledge, no comparison has been carried out between feedforward and state-of-the-art recurrent networks when applied to speech recognition. This paper analyzes this aspect in detail on a well-tuned French speech recognition task. In addition, we propose a simple and efficient method to normalize language model probabilities across different vocabularies, and we show how to speed up training of recurrent neural networks by parallelization.
引用
收藏
页码:8430 / 8434
页数:5
相关论文
共 15 条
[1]  
[Anonymous], P INT
[2]  
[Anonymous], 2005, P MACHINE LEARNING R
[3]  
Arisoy E., 2012, Proceedings of the NAACL-HLT 2012 Workshop: will we ever really replace the N-gram Model? On the future of language modeling for HLT, P20
[4]  
Bengio Y, 2001, ADV NEUR IN, V13, P932
[5]  
GOODMAN J, 2001, P ICASSP, P561
[6]   A bit of progress in language modeling [J].
Goodman, JT .
COMPUTER SPEECH AND LANGUAGE, 2001, 15 (04) :403-434
[7]  
HOFFMEISTER B, 2008, P INT C SPOK LANG PR, P232
[8]  
KNESER R, 1995, INT CONF ACOUST SPEE, P181, DOI 10.1109/ICASSP.1995.479394
[9]  
Le H.-S., 2011, Proc. of Interspeech'11, P1469
[10]  
Le H. S., P NAACL HLT 2012 WOR, P1