A Sliding Window Framework for Word Spotting Based on Word Attributes

被引:14
作者
Ghosh, Suman K. [1 ]
Valveny, Ernest [1 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Dept Ciencies Computacio, E-08193 Barcelona, Spain
来源
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015) | 2015年 / 9117卷
关键词
Word spotting; Sliding window; Word attributes; RECOGNITION;
D O I
10.1007/978-3-319-19390-8_73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a segmentation-free approach to word spotting. Word images are first encoded into feature vectors using Fisher Vector. Then, these feature vectors are used together with pyramidal histogram of characters labels (PHOC) to learn SVM-based attribute models. Documents are represented by these PHOC based word attributes. To efficiently compute the word attributes over a sliding window, we propose to use an integral image representation of the document using a simplified version of the attribute model. Finally we re-rank the top word candidates using the more discriminative full version of the word attributes. We show state-of-the-art results for segmentation-free query-by-example word spotting in single-writer and multi-writer standard datasets.
引用
收藏
页码:652 / 661
页数:10
相关论文
共 21 条
[1]   Word Spotting and Recognition with Embedded Attributes [J].
Almazan, Jon ;
Gordo, Albert ;
Fornes, Alicia ;
Valveny, Ernest .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) :2552-2566
[2]   Segmentation-free word spotting with exemplar SVMs [J].
Almazan, Jon ;
Gordo, Albert ;
Fornes, Alicia ;
Valveny, Ernest .
PATTERN RECOGNITION, 2014, 47 (12) :3967-3978
[3]  
[Anonymous], INT C FRONT HANDWR R
[4]  
Arandjelovic R, 2012, PROC CVPR IEEE, P2911, DOI 10.1109/CVPR.2012.6248018
[5]   Total recall: Automatic query expansion with a generative feature model for object retrieval [J].
Chum, Ondrej ;
Philbin, James ;
Sivic, Josef ;
Isard, Michael ;
Zisserman, Andrew .
2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :496-+
[6]  
Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678
[7]  
Dalal N., 2005, IEEE C COMP VIS PATT
[8]  
Fischer A., 2010, INT C PATT REC
[9]   A Novel Word Spotting Method Based on Recurrent Neural Networks [J].
Frinken, Volkmar ;
Fischer, Andreas ;
Manmatha, R. ;
Bunke, Horst .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) :211-224
[10]  
Kovalchuk A., 2014, INT C FRONT HANDWR R