Boosting the deep multidimensional long-short-term memory network for handwritten recognition systems

被引:14
作者
Castro, Dayvid [1 ]
Bezerra, Byron L. D. [1 ]
Valenca, Meuser [1 ]
机构
[1] Univ Pernambuco, Polytech Sch Pernambuco, Recife, PE, Brazil
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
MDLSTM; handwriting recognition; recurrent neural networks;
D O I
10.1109/ICFHR-2018.2018.00031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main challenges in the handwriting recognition area lies in identifying complete lines of handwritten text. In this paper, we propose a handwriting recognition system based on a deep multidimensional long-short-term memory (MDLSTM) network within a hybrid hidden Markov model framework. The MDLSTM architecture was elaborated to enhance the recognition performance and decrease the recognition time. Accordingly, we present modifications regarding the layers order and the number of pooling layers compared to a standard MDLSTM model. Since the results reported in the literature for deeper MDLSTM architectures relies on optimizing the network width with a fixed depth, we investigate the trade-off between both these properties to obtain an optimal topology. The system was evaluated with English handwritten text lines from the IAM database and the experiments demonstrated that the proposed MDLSTM architecture was able to maintain a robust recognition performance (around 3.6% CER and 10.5% WER) and present significant speedups, approximately 48% and 32% faster than the state-of-the-art MDLSTM optical model, regarding the learning and classification times, respectively. The full system including a decoder with linguistic knowledge presents competitive results with the state-of-the-art.
引用
收藏
页码:127 / 132
页数:6
相关论文
共 30 条
[1]  
Sanchez JA, 2016, INT CONF FRONT HAND, P630, DOI [10.1109/ICFHR.2016.0120, 10.1109/ICFHR.2016.112]
[2]  
[Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.1007/978-1-4471-4072-6_12
[3]  
[Anonymous], 2016, P 4 INT C LEARN REPR
[4]  
[Anonymous], 2011, WORKSH AUT SPEECH RE
[5]  
[Anonymous], 2015, THESIS
[6]  
Bauer L., 1993, MANUAL INFORM ACCOMP
[7]  
Bluche T., 2017, P 13 INT C DOC AN RE, P13
[8]  
Bluche T., 2017, HANDWRITING RECOGNIT, P113
[9]   The A2iA Arabic Handwritten Text Recognition System at the OpenHaRT2013 Evaluation [J].
Bluche, Theodore ;
Louradour, Jerome ;
Knibbe, Maxime ;
Moysset, Bastien ;
Benzeghiba, Mohamed Faouzi ;
Kermorvant, Christopher .
2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, :161-165
[10]   An empirical study of smoothing techniques for language modeling [J].
Chen, SF ;
Goodman, J .
COMPUTER SPEECH AND LANGUAGE, 1999, 13 (04) :359-394