Knowledge transfer using Neural network based approach for handwritten text recognition

被引:8
作者
Nair, Rathin Radhakrishnan [1 ]
Sankaran, Nishant [1 ]
Kota, Bharagava Urala [1 ]
Tulyakov, Sergey [1 ]
Setlur, Srirangaraj [1 ]
Govindaraju, Venu [1 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
来源
2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS) | 2018年
关键词
handwriting recognition; transfer learning; adaptive recognition; lstm; cnn;
D O I
10.1109/DAS.2018.75
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of a writer adaptive handwriting recognition system is to build a model that improves the recognition of a generic recognition model for a specific author. In this work, we show how structural representation learned from a generic writer-independent handwriting recognition model can be customized to individual authors. Convolutional Neural Network have shown outstanding performance in learning image-based representation that were used for classification. Additionally, they have been used along with Recurrent Neural Network (RNN) or its variations like, LSTM and GRU layers to analyze and understand sequences in handwriting recognition, sentence analysis, voice recognition etc. In most cases, the CNNs serve as a feature extractor instead of low-level hand-designed features that were used previously for the above mentioned classification tasks. We design a method to reuse weights from layers trained on the IAM offline handwritten dataset to compute mid-level image representation for text in the Washington and Moore dataset. We show that despite differences in the writing style, fonts across these datasets, the transferred representation is able to capture a spatio-temporal representation leading to significantly improved recognition results. We hypothesize that the performance is solely not dependent on the number of samples and the model is evaluated with varying amount of fine-tuning samples showing promising results backing the hypothesis.
引用
收藏
页码:441 / 446
页数:6
相关论文
共 25 条
[1]  
[Anonymous], 2007, INT C NEUR INF PROC
[2]  
[Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.1007/978-1-4471-4072-6_12
[3]  
[Anonymous], 2013, ICML
[4]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[5]   A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling for Handwriting Recognition [J].
Bluche, Theodore ;
Ney, Hermann ;
Kermorvant, Christopher .
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 :199-210
[6]  
Caruana R., 1995, Advances in Neural Information Processing Systems 7, P657
[7]  
Connell S. D., 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), P434, DOI 10.1109/ICDAR.1999.791817
[8]  
Doetsch P., 2016, ARXIV160800895
[9]  
Graves A., 2006, PROC INT C MACH LEAR, P369, DOI DOI 10.1145/1143844.1143891
[10]  
Graves A, 2007, LECT NOTES COMPUT SC, V4668, P549