Offline continuous handwriting recognition using sequence to sequence neural networks

被引:91
作者
Sueiras, Jorge [1 ]
Ruiz, Victoria [1 ]
Sanchez, Angel [1 ]
Velez, Jose F. [1 ]
机构
[1] Rey Juan Carlos Univ, Madrid, Spain
关键词
Artificial intelligence; Handwriting recognition; Convolutional Neural Networks; Recurrent Neural Networks; Sequence to sequence networks;
D O I
10.1016/j.neucom.2018.02.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes the use of a new neural network architecture that combines a deep convolutional neural network with an encoder-decoder, called sequence to sequence, to solve the problem of recognizing isolated handwritten words. The proposed architecture aims to identify the characters and contextualize them with their neighbors to recognize any given word. Our model proposes a novel way to extract relevant visual features from a word image. It combines the use of a horizontal sliding window, to extract image patches, and the application of the LeNet-5 convolutional architecture to identify the characters. Extracted features are modeled using a sequence-to-sequence architecture to encode the visual characteristics and then to decode the sequence of characters in the handwritten text image. We test the proposed model on two handwritten databases (IAM and RIMES) under several experiments to determine the optimal parameterization of the model. Competitive results above those presented in the current state-of-the-art, on handwriting models, are achieved. Without using any language model and with closed dictionary, we obtain a word error rate in the test set of 12.7% in IAM and 6.6% in RIMES. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:119 / 128
页数:10
相关论文
共 41 条
[1]  
Abadi M., 2015, TensorFlow: Large-scale machine learning on heterogeneous systems.
[2]  
Aiquan Yuan, 2012, Proceedings of the 10th IAPR International Workshop on Document Analysis Systems (DAS 2012), P125, DOI 10.1109/DAS.2012.61
[3]   Word Spotting and Recognition with Embedded Attributes [J].
Almazan, Jon ;
Gordo, Albert ;
Fornes, Alicia ;
Valveny, Ernest .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) :2552-2566
[4]  
[Anonymous], CORR
[5]  
[Anonymous], P INT C INT SYST DES
[6]  
[Anonymous], 2016, INT, DOI DOI 10.5120/IJCA2016908349
[7]  
[Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.1007/978-1-4471-4072-6_12
[8]  
[Anonymous], 2007, P 2007 INT C ART NEU
[9]  
[Anonymous], DOCUMENT RECOGNITION
[10]  
[Anonymous], 1989, Manual of information to accompany a standard corpus of present-day edited American English, for use with digital computers