Segmentation-free word spotting with exemplar SVMs

被引:44
作者
Almazan, Jon [1 ]
Gordo, Albert [2 ]
Fornes, Alicia [1 ]
Valveny, Ernest [1 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Dept Ciencies Computacio, Bellaterra 08193, Barcelona, Spain
[2] INRIA Grenoble, Rhone Alpes Res Ctr, F-38330 Montbonnot St Martin, France
关键词
Word spotting; Segmentation-free; Unsupervised learning; Reranking; Query expansion; Compression; RECOGNITION; MODEL;
D O I
10.1016/j.patcog.2014.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose an unsupervised segmentation-free method for word spotting in document images. Documents are represented with a grid of HOG descriptors, and a sliding-window approach is used to locate the document regions that are most similar to the query. We use the Exemplar SVM framework to produce a better representation of the query in an unsupervised way. Then, we use a more discriminative representation based on Fisher Vector to rerank the best regions retrieved, and the most promising ones are used to expand the Exemplar SVM training set and improve the query representation. Finally, the document descriptors are precomputed and compressed with Product Quantization. This offers two advantages: first, a large number of documents can be kept in RAM memory at the same time. Second, the sliding window becomes significantly faster since distances between quantized HOG descriptors can be precomputed. Our results significantly outperform other segmentation-free methods in the literature, both in accuracy and in speed and memory usage. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3967 / 3978
页数:12
相关论文
共 39 条
  • [1] Good Practice in Large-Scale Learning for Image Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 507 - 520
  • [2] Almazan J., EXEMPLAR WORD SPOTTI
  • [3] Efficient Exemplar Word Spotting
    Almazan, Jon
    Gordo, Albert
    Fornes, Alicia
    Valveny, Ernest
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [4] [Anonymous], INT C FRONT HANDWR R
  • [5] Multiple queries for large scale specific object retrieval
    Arandjelovic, Relja
    Zisserman, Andrew
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [6] Arandjelovic R, 2012, PROC CVPR IEEE, P2911, DOI 10.1109/CVPR.2012.6248018
  • [7] The devil is in the details: an evaluation of recent feature encoding methods
    Chatfield, Ken
    Lempitsky, Victor
    Vedaldi, Andrea
    Zisserman, Andrew
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [8] Total recall: Automatic query expansion with a generative feature model for object retrieval
    Chum, Ondrej
    Philbin, James
    Sivic, Josef
    Isard, Michael
    Zisserman, Andrew
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 496 - +
  • [9] Chum O, 2011, PROC CVPR IEEE, P889, DOI 10.1109/CVPR.2011.5995601
  • [10] Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678