Improving Recurrent Neural Networks for Offline Arabic Handwriting Recognition by Combining Different Language Models

被引:8
作者
Jemni, Sana Khamekhem [1 ]
Kessentini, Yousri [1 ,2 ,3 ]
Kanoun, Slim [1 ]
机构
[1] Univ Sfax, MIRACL Lab, Sfax, Tunisia
[2] Digital Res Ctr Sfax, BP 275, Sfax 3021, Tunisia
[3] Univ Rouen, LITIS Lab, EA 4108, St Etienne Du Rouvray, France
关键词
Convolutional neural network; multi-dimensional long-short term memory network; histogram of oriented gradients; bidirectional long-short term memory network; hybrid language model; out of vocabulary word; SYSTEM;
D O I
10.1142/S0218001420520072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In handwriting recognition, the design of relevant features is very important, but it is a daunting task. Deep neural networks are able to extract pertinent features automatically from the input image. This drops the dependency on handcrafted features, which is typically a trial and error process. In this paper, we perform an exhaustive experimental evaluation of learned against handcrafted features for Arabic handwriting recognition task. Moreover, we focus on the optimization of the competing full-word based language models by incorporating different characters and sub-words models. We extensively investigate the use of different sub-word-based language models, mainly characters, pseudo-words, morphemes and hybrid units in order to enhance the full-word handwriting recognition system for Arabic script. The proposed method allows the recognition of any out of vocabulary word as an arbitrary sequence of sub-word units. The KHATT database has been used as a benchmark for the Arabic handwriting recognition. We show that combining multiple language models enhances considerably the recognition performance for a morphologically rich language like Arabic. We achieve the state-of-the-art performance on the KHATT dataset.
引用
收藏
页数:29
相关论文
共 49 条
  • [21] Open Vocabulary Arabic Handwriting Recognition Using Morphological Decomposition
    Hamdani, Mahdi
    Mousa, Amr El-Desoky
    Ney, Hermann
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 280 - 284
  • [22] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [23] Hull J. J., 1998, WORLD SCI, V1, P40
  • [24] Jemni S. K., 2016, P 16 INT C HYBR INT, P267
  • [25] Out of vocabulary word detection and recovery in Arabic handwritten text recognition
    Jemni, Sana Khamekhem
    Kessentini, Yousri
    Kanoun, Slim
    [J]. PATTERN RECOGNITION, 2019, 93 : 507 - 520
  • [26] Offline Arabic Handwriting Recognition Using BLSTMs Combination
    Jemni, Sana Khamekhem
    Kessentini, Yousri
    Kanoun, Slim
    Ogier, Jean-Marc
    [J]. 2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 31 - 36
  • [27] LeCun Y, 1997, INT CONF ACOUST SPEE, P151, DOI 10.1109/ICASSP.1997.599580
  • [28] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [29] LeCun Y, 2010, IEEE INT SYMP CIRC S, P253, DOI 10.1109/ISCAS.2010.5537907
  • [30] Levenshtein V. I., 1966, Soviet Physics Doklady, V10, p707 710