OCR-independent and Segmentation-free Word-Spotting in Handwritten Arabic Archive Documents

被引:0
|
作者
Aouadi, N. [1 ]
Kacem, A. [1 ]
机构
[1] LaTICE, Res Lab Technol Informat & Commun & Elect Engn, Tunis, Tunisia
来源
2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA) | 2013年
关键词
OCR; Word-spotting; Generalized Hough Transform; Clustering; Handwritten Recognition; Historical document; RETRIEVAL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, a word-spotting approach is presented that can help in reading handwritten Arabic Archive Documents. Because of the low quality of these documents, the proposed approach is free segmentation, independent of OCR, using a global transformation of word images. It is a based learning approach which employs Generalized Hough Transform (GHT) technique. It detects words, described by their models, in documents images by finding the model's position in the image. With the GHT, the problem of finding the model's position is transformed to a problem of finding the transformation's parameter that maps the model into the image. Parameters such as Hough threshold and distance between voting points are considered for a better location and recognition of words. We tested our system on registers from the 19th century onwards, held in the National Archives of Tunisia. Our first experiments reach an average of 94% of well-spotted words.
引用
收藏
页码:36 / 41
页数:6
相关论文
共 50 条
  • [21] Word Hypotheses for Segmentation-free Word Spotting in Historic Document Images
    Rothacker, Leonard
    Sudholt, Sebastian
    Rusakov, Eugen
    Kasperidus, Matthias
    Fink, Gernot A.
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1174 - 1179
  • [22] Onmilingual segmentation-free word spotting for ancient manuscripts indexation
    Leydier, Y
    Le Bourgeois, F
    Emptoz, H
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 533 - 537
  • [23] An old greek handwritten OCR system based on an efficient segmentation-free approach
    K. Ntzios
    B. Gatos
    I. Pratikakis
    T. Konidaris
    S. J. Perantonis
    International Journal of Document Analysis and Recognition (IJDAR), 2007, 9 : 179 - 192
  • [24] An efficient segmentation-free approach to assist old Greek handwritten manuscript OCR
    B. Gatos
    K. Ntzios
    I. Pratikakis
    S. Petridis
    T. Konidaris
    S. J. Perantonis
    Pattern Analysis and Applications, 2006, 8 : 305 - 320
  • [25] Learning-free handwritten word spotting method for historical handwritten documents
    Mohammed, Hanadi Hassen
    Subramanian, Nandhini
    Al-Madeed, Somaya
    IET IMAGE PROCESSING, 2021, 15 (10) : 2332 - 2341
  • [26] A segmentation-free recognition technique to assist old Greek handwritten manuscript OCR
    Gatos, B
    Ntzios, K
    Pratikakis, I
    Petridis, S
    Konidaris, T
    Perantonis, SJ
    DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 63 - 74
  • [27] Fusion of explicit segmentation based system and segmentation-free based system for on-line Arabic handwritten word recognition
    Khlif, Hanen
    Prum, Sophea
    Kessentini, Yousri
    Kanoun, Slim
    Ogier, Jean Marc
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 399 - 404
  • [28] An efficient segmentation-free approach to assist old Greek handwritten manuscript OCR
    Gatos, B
    Ntzios, K
    Pratikakis, I
    Petridis, S
    Konidaris, T
    Perantonis, S
    PATTERN ANALYSIS AND APPLICATIONS, 2006, 8 (04) : 305 - 320
  • [29] An old greek handwritten OCR system based on an efficient segmentation-free approach
    Ntzios, K.
    Gatos, B.
    Pratikakis, I.
    Konidaris, T.
    Perantonis, S. J.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) : 179 - 192
  • [30] On Evaluation of Segmentation-Free Word Spotting Approaches Without Hard Decisions
    Pantke, Werner
    Maergner, Volker
    Fingscheidt, Tim
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1300 - 1304