Segmentation-free word spotting with exemplar SVMs

被引：46

作者：

Almazan, Jon ^{[1
]}

Gordo, Albert ^{[2
]}

Fornes, Alicia ^{[1
]}

Valveny, Ernest ^{[1
]}

机构：

[1] Univ Autonoma Barcelona, Comp Vis Ctr, Dept Ciencies Computacio, Bellaterra 08193, Barcelona, Spain

[2] INRIA Grenoble, Rhone Alpes Res Ctr, F-38330 Montbonnot St Martin, France

来源：

PATTERN RECOGNITION | 2014年 / 47卷 / 12期

关键词：

Word spotting; Segmentation-free; Unsupervised learning; Reranking; Query expansion; Compression; RECOGNITION; MODEL;

D O I：

10.1016/j.patcog.2014.06.005

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we propose an unsupervised segmentation-free method for word spotting in document images. Documents are represented with a grid of HOG descriptors, and a sliding-window approach is used to locate the document regions that are most similar to the query. We use the Exemplar SVM framework to produce a better representation of the query in an unsupervised way. Then, we use a more discriminative representation based on Fisher Vector to rerank the best regions retrieved, and the most promising ones are used to expand the Exemplar SVM training set and improve the query representation. Finally, the document descriptors are precomputed and compressed with Product Quantization. This offers two advantages: first, a large number of documents can be kept in RAM memory at the same time. Second, the sliding window becomes significantly faster since distances between quantized HOG descriptors can be precomputed. Our results significantly outperform other segmentation-free methods in the literature, both in accuracy and in speed and memory usage. (C) 2014 Elsevier Ltd. All rights reserved.

引用

页码：3967 / 3978

页数：12

共 39 条

[1] Good Practice in Large-Scale Learning for Image Classification [J].

Akata, Zeynep ;

Perronnin, Florent ;

Harchaoui, Zaid ;

Schmid, Cordelia .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) :507-520

[2]

Almazan J., EXEMPLAR WORD SPOTTI

[3] Efficient Exemplar Word Spotting [J].

Almazan, Jon ;

Gordo, Albert ;

Fornes, Alicia ;

Valveny, Ernest .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,

[4]

[Anonymous], INT C FRONT HANDWR R

[5] Multiple queries for large scale specific object retrieval [J].

Arandjelovic, Relja ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,

[6]

Arandjelovic R, 2012, PROC CVPR IEEE, P2911, DOI 10.1109/CVPR.2012.6248018

[7] The devil is in the details: an evaluation of recent feature encoding methods [J].

Chatfield, Ken ;

Lempitsky, Victor ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,

[8] Total recall: Automatic query expansion with a generative feature model for object retrieval [J].

Chum, Ondrej ;

Philbin, James ;

Sivic, Josef ;

Isard, Michael ;

Zisserman, Andrew .

2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :496-+

[9]

Chum O, 2011, PROC CVPR IEEE, P889, DOI 10.1109/CVPR.2011.5995601

[10]

Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678

← 1 2 3 4 →