A Coarse-to-Fine Approach for Handwritten Word Spotting in Large Scale Historical Documents Collection

被引:4
作者
Almazan, J. [1 ]
Fernandez, D. [1 ]
Fornes, A. [1 ]
Llados, J. [1 ]
Valveny, E. [1 ]
机构
[1] Univ Autonoma Barcelona, Dept Ciencies Comp, Comp Vis Ctr, E-08193 Barcelona, Spain
来源
13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012) | 2012年
关键词
word spotting; historical documents; appearance models; word indexation;
D O I
10.1109/ICFHR.2012.151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose an approach for word spotting in handwritten document images. We state the problem from a focused retrieval perspective, i.e. locating instances of a query word in a large scale dataset of digitized manuscripts. We combine two approaches, namely one based on word segmentation and another one segmentation-free. The first approach uses a hashing strategy to coarsely prune word images that are unlikely to be instances of the query word. This process is fast but has a low precision due to the errors introduced in the segmentation step. The regions containing candidate words are sent to the second process based on a state of the art technique from the visual object detection field. This discriminative model represents the appearance of the query word and computes a similarity score. In this way we propose a coarse-to-fine approach achieving a compromise between efficiency and accuracy. The validation of the model is shown using a collection of old handwritten manuscripts. We appreciate a substantial improvement in terms of precision regarding the previous proposed method with a low computational cost increase.
引用
收藏
页码:455 / 460
页数:6
相关论文
共 13 条
[1]  
[Anonymous], 2011, INT C COMP VIS
[2]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[3]   A pictorial dictionary for printed Farsi subwords [J].
Ebrahimi, Afshin ;
Kabir, Ehisanollah .
PATTERN RECOGNITION LETTERS, 2008, 29 (05) :656-663
[4]   Object Detection with Discriminatively Trained Part-Based Models [J].
Felzenszwalb, Pedro F. ;
Girshick, Ross B. ;
McAllester, David ;
Ramanan, Deva .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1627-1645
[5]  
Fernández D, 2011, LECT NOTES COMPUT SC, V6669, P628
[6]  
Gatos Basilis, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P271, DOI 10.1109/ICDAR.2009.236
[7]  
Glucksman H., 1967, Proceeding of IEEE Computer Conference, P138141
[8]   Text search for medieval manuscript images [J].
Leydier, Yann ;
Lebourgeois, Frank ;
Emptoz, Hubert .
PATTERN RECOGNITION, 2007, 40 (12) :3552-3567
[9]  
Lüthy F, 2007, PROC INT CONF DOC, P8
[10]   A scale space approach for automatically segmenting words from historical handwritten documents [J].
Manmatha, R ;
Rothfeder, JL .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) :1212-1225