Page Level Input for Handwritten Text Recognition in Document Images

被引:0
作者
Kumari, Lalita [1 ]
Singh, Sukhdeep [2 ,3 ]
Sharma, Anuj [1 ]
机构
[1] Panjab Univ, Dept Comp Sci & Applicat, Chandigarh, India
[2] DM Coll, Moga, Punjab, India
[3] Panjab Univ, Chandigarh, India
来源
PROCEEDINGS OF 7TH INTERNATIONAL CONFERENCE ON HARMONY SEARCH, SOFT COMPUTING AND APPLICATIONS (ICHSA 2022) | 2022年 / 140卷
关键词
Handwritten text recognition; Segmentation; Word attention; Convolutional neural network; Word beam search;
D O I
10.1007/978-981-19-2948-9_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Important information present all over the world in the form of libraries need to be digitized. An end-to-end handwritten text recognition system covers every aspect, from capturing an image as input to predict text written inside it. In the present study, we have presented a document image-based handwritten text recognition system with three parts as: page level image as input with multiple lines, recognition of text of input image and attention level of text. This model is trained and tested on various split ratios of labelled words of the JAM handwriting dataset. The recognition model evaluation is performed on two publicly available datasets that are JAM handwriting dataset and handwritten short answer scoring dataset. We have used popular CTC word beam search technique by Scheidl et al. [1] in recognition phase, and attention of text has been incorporated using established sentic data by Ref. [2]. Our findings at recognition level result in 8.99% validation character error rate on JAM handwriting dataset and 35.92 BLEU score on the handwritten short answering scoring dataset. The post recognition findings suggest text attention role in acceptance of recognized words useful for real life business documents.
引用
收藏
页码:171 / 183
页数:13
相关论文
共 22 条
  • [1] Arivazhagan M, 2007, P SPIE
  • [2] Efficient illumination compensation techniques for text images
    Chen, Kuo-Nan
    Chen, Chin-Hao
    Chang, Chin-Chen
    [J]. DIGITAL SIGNAL PROCESSING, 2012, 22 (05) : 726 - 733
  • [3] Simultaneous Script Identification and Handwriting Recognition via Multi-Task Learning of Recurrent Neural Networks
    Chen, Zhuo
    Wu, Yichao
    Yin, Pei
    Liu, Cheng-Lin
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 525 - 530
  • [4] Exploring the Impact of Handwriting Recognition on the Automated Scoring of Handwritten Student Answers
    Gold, Christian
    Zesch, Torsten
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 252 - 257
  • [5] Graves A., 2006, PROC 23 INT C MACHIN, P369
  • [6] Word Spotting and Recognition using Deep Embedding
    Krishnan, Praveen
    Dutta, Kartik
    Jawahar, C. V.
    [J]. 2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 1 - 6
  • [7] A Review of Deep Learning Techniques in Document Image Word Spotting
    Kumari, Lalita
    Sharma, Anuj
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2022, 29 (02) : 1085 - 1106
  • [8] Liwicki M, 2012, STUD COMPUT INTELL, V386, P5
  • [9] Curriculum Learning for Handwritten Text Line Recognition
    Louradour, Jerome
    Kermorvant, Christopher
    [J]. 2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 56 - 60
  • [10] Manmatha R, 1999, LECT NOTES COMPUT SC, V1682, P22