Data Augmentation and Text Recognition on Khmer Historical Manuscripts

被引:6
作者
Valy, Dona [1 ]
Verleysen, Michel [2 ]
Chhun, Sophea [1 ]
机构
[1] Inst Technol Cambodia, Dept Informat & Commun Engn, Phnom Penh, Cambodia
[2] Catholic Univ Louvain, ICTEAM Inst, Ottignies, Belgium
来源
2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020) | 2020年
关键词
historical document analysis; palm leaf manuscript; neural network; data augmentation; CHARACTER;
D O I
10.1109/ICFHR2020.2020.00024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis and recognition of historical documents faces many challenges, one of which is the scarcity of the ground truth data needed for most machine learning techniques, deep learning in particular. In this paper, we present a novel approach which significantly augments the word image samples generated from an existing dataset of Khmer ancient palm leaf manuscripts. Instead of segmenting real Khmer words, we combine the annotated glyphs into groups called sub-syllables. A new text recognition method is also proposed to take into account the spatially complex structure of Khmer writing. The proposed method is composed of two main modules: a feature generator and a decoder. The generator utilizes convolutional blocks, inception blocks, and also a bi-directional LSTM to encode information extracted from the input image so that it can be decoded by the attention-based decoder to predict the final text transcription. Experiments are conducted on a new dataset of groups of sub-syllables constructed from annotated glyphs of the SleukRith Set.
引用
收藏
页码:73 / 78
页数:6
相关论文
共 16 条
  • [1] Sanchez JA, 2016, INT CONF FRONT HAND, P630, DOI [10.1109/ICFHR.2016.0120, 10.1109/ICFHR.2016.112]
  • [2] [Anonymous], 2017, 4 INT WORKSH HIST DO
  • [3] [Anonymous], 2015, Int. J. Signal Process, DOI DOI 10.14257/IJSIP.2015.8.2.37
  • [4] Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
    Bluche, Theodore
    Louradour, Jerome
    Messina, Ronaldo
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1050 - 1055
  • [5] Boosting the deep multidimensional long-short-term memory network for handwritten recognition systems
    Castro, Dayvid
    Bezerra, Byron L. D.
    Valenca, Meuser
    [J]. PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 127 - 132
  • [6] ICFHR 2018 Competition on Recognition of Historical Arabic Scientific Manuscripts-RASM2018
    Clausner, Christian
    Antonacopoulos, Apostolos
    McGregor, Nora
    Wilson-Nunn, Daniel
    [J]. PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 471 - 476
  • [7] A Compact CNN-DBLSTM Based Character Model For Offline Handwriting Recognition with Tucker Decomposition
    Ding, Haisong
    Chen, Kai
    Yuan, Ye
    Cai, Meng
    Sun, Lei
    Liang, Sen
    Huo, Qiang
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 507 - 512
  • [8] Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns
    Nam Tuan Ly
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    [J]. PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 74 - 79
  • [9] Nguyen C. K., 2017, 4 INT WORKSH HIST DO
  • [10] Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?
    Puigcerver, Joan
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 67 - 72