Data Augmentation and Text Recognition on Khmer Historical Manuscripts

被引:7
作者
Valy, Dona [1 ]
Verleysen, Michel [2 ]
Chhun, Sophea [1 ]
机构
[1] Inst Technol Cambodia, Dept Informat & Commun Engn, Phnom Penh, Cambodia
[2] Catholic Univ Louvain, ICTEAM Inst, Ottignies, Belgium
来源
2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020) | 2020年
关键词
historical document analysis; palm leaf manuscript; neural network; data augmentation; CHARACTER;
D O I
10.1109/ICFHR2020.2020.00024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis and recognition of historical documents faces many challenges, one of which is the scarcity of the ground truth data needed for most machine learning techniques, deep learning in particular. In this paper, we present a novel approach which significantly augments the word image samples generated from an existing dataset of Khmer ancient palm leaf manuscripts. Instead of segmenting real Khmer words, we combine the annotated glyphs into groups called sub-syllables. A new text recognition method is also proposed to take into account the spatially complex structure of Khmer writing. The proposed method is composed of two main modules: a feature generator and a decoder. The generator utilizes convolutional blocks, inception blocks, and also a bi-directional LSTM to encode information extracted from the input image so that it can be decoded by the attention-based decoder to predict the final text transcription. Experiments are conducted on a new dataset of groups of sub-syllables constructed from annotated glyphs of the SleukRith Set.
引用
收藏
页码:73 / 78
页数:6
相关论文
共 16 条
[1]  
Sanchez JA, 2016, INT CONF FRONT HAND, P630, DOI [10.1109/ICFHR.2016.0120, 10.1109/ICFHR.2016.112]
[2]  
[Anonymous], 2017, 4 INT WORKSH HIST DO
[3]  
[Anonymous], 2015, Int. J. Signal Process, DOI DOI 10.14257/IJSIP.2015.8.2.37
[4]   Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention [J].
Bluche, Theodore ;
Louradour, Jerome ;
Messina, Ronaldo .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :1050-1055
[5]   Boosting the deep multidimensional long-short-term memory network for handwritten recognition systems [J].
Castro, Dayvid ;
Bezerra, Byron L. D. ;
Valenca, Meuser .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :127-132
[6]   ICFHR 2018 Competition on Recognition of Historical Arabic Scientific Manuscripts-RASM2018 [J].
Clausner, Christian ;
Antonacopoulos, Apostolos ;
McGregor, Nora ;
Wilson-Nunn, Daniel .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :471-476
[7]   A Compact CNN-DBLSTM Based Character Model For Offline Handwriting Recognition with Tucker Decomposition [J].
Ding, Haisong ;
Chen, Kai ;
Yuan, Ye ;
Cai, Meng ;
Sun, Lei ;
Liang, Sen ;
Huo, Qiang .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :507-512
[8]   Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns [J].
Nam Tuan Ly ;
Cuong Tuan Nguyen ;
Nakagawa, Masaki .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :74-79
[9]  
Nguyen C. K., 2017, 4 INT WORKSH HIST DO
[10]   Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition? [J].
Puigcerver, Joan .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :67-72