A segmentation-free word spotting method for historical printed documents

被引:7
作者
Konidaris, Thomas [1 ]
Kesidis, Anastasios L. [2 ]
Gatos, Basilis [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Computat Intelligence Lab, Patriarchou Grigoriou St, Athens 15310, Greece
[2] Technol Educ Inst Athens, Dept Surveying Engn, Athens 12210, Greece
关键词
Segmentation-free; Word spotting; Historical documents; RETRIEVAL; ALIGNMENT;
D O I
10.1007/s10044-015-0476-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a two-step segmentation-free word spotting method for historical printed documents is presented. The first step involves a minimum distance matching between a query keyword image and a document page image using keypoint correspondences. In the second step of the method, the matched keypoints on the document image serve as indicators for creating candidate image areas. The query keyword image is matched against the candidate image areas in order to properly estimate the bounding boxes of the detected word instances. The method is evaluated using two datasets of different languages and is compared against segmentation-free state-of-the-art methods. The experimental results show that the proposed method outperforms significantly the competitive approaches.
引用
收藏
页码:963 / 976
页数:14
相关论文
共 36 条
  • [1] [Anonymous], 2007, Tech. Rep
  • [2] Ataer Esra., 2007, CIVR, P341
  • [3] Cao S., 2007, INT C ADV PATT REC, P45
  • [4] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
    FISCHLER, MA
    BOLLES, RC
    [J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
  • [5] Gatos Basilis, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P271, DOI 10.1109/ICDAR.2009.236
  • [6] Hartley R., 2003, Multiple view geometry in computer vision
  • [7] Jawahar C.V., 2004, P WORKSHOP COMPUTER, P73
  • [8] Keyword spotting for cursive document retrieval
    Keaton, P
    Greenspan, H
    Goodman, R
    [J]. WORKSHOP ON DOCUMENT IMAGE ANALYSIS (DIA'97), PROCEEDINGS: IN COOPERATION WITH CVPR '97, 1997, : 74 - 81
  • [9] Kim SH, 2005, LECT NOTES COMPUT SC, V3815, P158
  • [10] Kluzner V, 2009, 10 INT C DOC AN REC, P501