Investigating Long Short-Term Memory Networks for Various Pattern Recognition Problems

被引:0
作者
Otte, Sebastian [1 ]
Liwicki, Marcus [2 ]
Krechel, Dirk [1 ]
机构
[1] Univ Appl Sci Wiesbaden, Wiesbaden, Germany
[2] German Res Ctr, Kaiserslautern, Germany
来源
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014 | 2014年 / 8556卷
关键词
RECURRENT NETWORKS; MODEL; LSTM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of this paper is to further investigate how and why long short-term memory networks (LSTM) perform so well on several pattern recognition problems. Our contribution is three-fold. First, we describe the main highlights of the LSTM architecture, especially when compared to standard recurrent neural networks (SRN). Second, we give an overview of previous studies to analyze the behavior of LSTMs on toy problems and some realistic data in the speech recognition domain. Third, the behavior of LSTMs is analyzed on novel problems which are relevant for pattern recognition research. Thereby, we analyze the ability of LSTMs to classify long sequences containing specific patterns at an arbitrary position on iteratively increasing the complexity of the problem under constant training conditions. We also compare the behavior of LSTMs to SRNs for text vs. non-text sequence classification on a real-world problem with significant non-local time-dependencies where the features are computed only locally. Finally, we discuss why LSTMs with standard training methods are not suited for the task of signature verification.
引用
收藏
页码:484 / 497
页数:14
相关论文
共 20 条
[1]  
Eck D, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P747, DOI 10.1109/NNSP.2002.1030094
[2]   Learning precise timing with LSTM recurrent networks [J].
Gers, FA ;
Schraudolph, NN ;
Schmidhuber, J .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (01) :115-143
[3]   LSTM recurrent networks learn simple context-free and context-sensitive languages [J].
Gers, FA ;
Schtmidhuber, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (06) :1333-1340
[4]  
Gers FA, 1999, IEE CONF PUBL, P850, DOI [10.1049/cp:19991218, 10.1162/089976600300015015]
[5]   Framewise phoneme classification with bidirectional LSTM and other neural network architectures [J].
Graves, A ;
Schmidhuber, J .
NEURAL NETWORKS, 2005, 18 (5-6) :602-610
[6]  
Graves A., 2008, THESIS
[7]  
Graves A, 2012, STUD COMPUT INTELL, V385, P47
[8]   A Novel Connectionist System for Unconstrained Handwriting Recognition [J].
Graves, Alex ;
Liwicki, Marcus ;
Fernandez, Santiago ;
Bertolami, Roman ;
Bunke, Horst ;
Schmidhuber, Juergen .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (05) :855-868
[9]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[10]   Fast model-based protein homology detection without alignment [J].
Hochreiter, Sepp ;
Heusel, Martin ;
Obermayer, Klaus .
BIOINFORMATICS, 2007, 23 (14) :1728-1736