A Sliding Window Framework for Word Spotting Based on Word Attributes

被引:15
作者
Ghosh, Suman K. [1 ]
Valveny, Ernest [1 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Dept Ciencies Computacio, E-08193 Barcelona, Spain
来源
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015) | 2015年 / 9117卷
关键词
Word spotting; Sliding window; Word attributes; RECOGNITION;
D O I
10.1007/978-3-319-19390-8_73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a segmentation-free approach to word spotting. Word images are first encoded into feature vectors using Fisher Vector. Then, these feature vectors are used together with pyramidal histogram of characters labels (PHOC) to learn SVM-based attribute models. Documents are represented by these PHOC based word attributes. To efficiently compute the word attributes over a sliding window, we propose to use an integral image representation of the document using a simplified version of the attribute model. Finally we re-rank the top word candidates using the more discriminative full version of the word attributes. We show state-of-the-art results for segmentation-free query-by-example word spotting in single-writer and multi-writer standard datasets.
引用
收藏
页码:652 / 661
页数:10
相关论文
共 21 条
[11]  
Leslie C., 2002, PACIFIC S BIOCOMPUTI
[12]   Towards an omnilingual word retrieval system for ancient manuscripts [J].
Leydier, Yann ;
Ouji, Asma ;
LeBourgeois, Frank ;
Emptoz, Hubert .
PATTERN RECOGNITION, 2009, 42 (09) :2089-2105
[13]   Text classification using string kernels [J].
Lodhi, H ;
Saunders, C ;
Shawe-Taylor, J ;
Cristianini, N ;
Watkins, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :419-444
[14]   The IAM-database: An English sentence database for offline handwriting recognition [J].
U.-V. Marti ;
H. Bunke .
International Journal on Document Analysis and Recognition, 2002, 5 (1) :39-46
[15]   Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition system [J].
Marti, UV ;
Bunke, H .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (01) :65-90
[16]   Improving the Fisher Kernel for Large-Scale Image Classification [J].
Perronnin, Florent ;
Sanchez, Jorge ;
Mensink, Thomas .
COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :143-156
[17]   Word spotting for historical documents [J].
Rath, Tony M. ;
Manmatha, R. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :139-152
[18]   Bag-of-Features HMMs for Segmentation-free Word Spotting in Handwritten Documents [J].
Rothacker, Leonard ;
Rusinol, Marcal ;
Fink, Gernot A. .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1305-1309
[19]   Browsing Heterogeneous Document Collections by a Segmentation-free Word Spotting Method [J].
Rusinol, Marcal ;
Aldavert, David ;
Toledo, Ricardo ;
Llados, Josep .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :63-67
[20]  
Vinciarelli A, 2004, IEEE T PATTERN ANAL, V26, P709, DOI 10.1109/TPAMI.2004.14