Recognizing Handwritten Text Lines in Ancient Document Images Based on a Gated Residual Recurrent Neural Network

被引:0
作者
Mechi, Olfa [1 ]
Mehri, Maroua [1 ]
Ben Amara, Najoua Essoukri [1 ]
机构
[1] Univ Sousse, LATIS Lab Adv Technol & Intelligent Syst, Ecole Natl Ingn Sousse, Sousse 4023, Tunisia
来源
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022 | 2022年 / 1653卷
关键词
Text line recognition; Historical handwritten documents; Gated mechanism; Skip connection; BLSTM; CTC; RECOGNITION; SEQUENCE;
D O I
10.1007/978-3-031-16210-7_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over several decades, many archives and libraries have highlighted the growing need to assist them in the preservation and enrichment of the huge mass of digitized documentary heritage by using efficient handwritten text recognition (HTR) frameworks. To address this issue, we propose in this paper a deep learning based framework for recognizing handwritten text lines in historical document images. The proposed framework is based on a gated residual recurrent neural network, called G2R2N. G2R2N is composed of two modules: encoder and decoder. The encoder module is based on merging the gated and skip connection layers, while the decoder module is composed of the bidirectional long short-term memory (BLSTM), followed by the connectionist temporal classification (CTC) architectures. The proposed framework is evaluated using the same evaluation metrics computed in the context of the ICDAR2017 competition. Numerical and qualitative observations are reported on different benchmark datasets used in the most well-known HTR contests.
引用
收藏
页码:250 / 263
页数:14
相关论文
共 40 条
  • [1] [Anonymous], 2018, DOCUMENT ANAL TEXT R
  • [2] Bezerra B., 2017, Handwriting: Recognition, Development and Analysis. Computer science, technology and applications
  • [3] Bluche T., 2015, Ph.D. thesis
  • [4] Bluche T, 2016, Arxiv, DOI arXiv:1604.08352
  • [5] Gated Convolutional Recurrent Neural Networks for Multilingual Handwriting Recognition
    Bluche, Theodore
    Messina, Ronaldo
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 646 - 651
  • [6] Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
    Bluche, Theodore
    Louradour, Jerome
    Messina, Ronaldo
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1050 - 1055
  • [7] Two-Stage Convolutional Neural Network forMedical Noise Removal via Image Decomposition
    Chang, Yi
    Yan, Luxin
    Chen, Meiya
    Fang, Houzhang
    Zhong, Sheng
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (06) : 2707 - 2721
  • [8] Efficient illumination compensation techniques for text images
    Chen, Kuo-Nan
    Chen, Chin-Hao
    Chang, Chin-Chen
    [J]. DIGITAL SIGNAL PROCESSING, 2012, 22 (05) : 726 - 733
  • [9] Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions
    Cojocaru, Iulian
    Cascianelli, Silvia
    Baraldi, Lorenzo
    Corsini, Massimiliano
    Cucchiara, Rita
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6096 - 6103
  • [10] Coquenet D, 2021, Arxiv, DOI arXiv:2012.03868