Handwritten word-spotting using hidden Markov models and universal vocabularies

被引:114
作者
Rodriguez-Serrano, Jose A. [1 ]
Perronnin, Florent [2 ]
机构
[1] Univ Autonoma Barcelona, CVC, Bellaterra 08193, Spain
[2] Xerox Res Ctr Europe, F-38240 Meylan, France
关键词
Word-spotting; Hidden Markov model; Score normalization; Universal vocabulary; Handwriting recognition; TEXT LINE; RECOGNITION; DOCUMENTS; SEGMENTATION; FEATURES;
D O I
10.1016/j.patcog.2009.02.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten word-spotting is traditionally viewed as an image matching task between one or multiple query word-images and a set of candidate word-images in a database. This is a typical instance of the query-by-example paradigm. In this article, we introduce a statistical framework for the word-spotting problem which employs hidden Markov models (HMMs) to model keywords and a Gaussian mixture model (GMM) for score normalization. We explore the use of two types of HMMs for the word modeling part: continuous HMMs (C-HMMs) and semi-continuous HMMs (SC-HMMs), i.e. HMMs with a shared set of Gaussians. We show on a challenging multi-writer corpus that the proposed statistical framework is always superior to a traditional matching system which uses dynamic time warping (DTW) for word-image distance computation. A very important finding is that the SC-HMM is superior when labeled training data is scarce-as low as one sample per keyword-thanks to the prior information which can be incorporated in the shared set of Gaussians. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2106 / 2116
页数:11
相关论文
共 57 条
  • [1] Word matching using single closed contours for indexing handwritten historical documents
    Adamek, Tornasz
    O'Connor, Noel E.
    Smeaton, Alan F.
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) : 153 - 165
  • [2] A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS
    BAUM, LE
    PETRIE, T
    SOULES, G
    WEISS, N
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01): : 164 - &
  • [3] TIED MIXTURE CONTINUOUS PARAMETER MODELING FOR SPEECH RECOGNITION
    BELLEGARDA, JR
    NAHAMOO, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (12): : 2033 - 2045
  • [4] Brakensiek A, 2003, PROC INT CONF DOC, P294
  • [5] BRAKENSIEK A, 2004, HANDWRITTEN ADDRESS, P103
  • [6] OFF-LINE CURSIVE HANDWRITING RECOGNITION USING HIDDEN MARKOV-MODELS
    BUNKE, H
    ROTH, M
    SCHUKATTALAMAZZINI, EG
    [J]. PATTERN RECOGNITION, 1995, 28 (09) : 1399 - 1413
  • [7] Bunke H, 2003, PROC INT CONF DOC, P448
  • [8] CAO H, 2007, 6 INT C ADV PATT REC
  • [9] CHEN MY, 1994, IEEE T PATTERN ANAL, V16, P481
  • [10] A HIGH-SPEED INGAASP/INP DFB LASER WITH AN AIR BRIDGE CONTACT CONFIGURATION
    CHEN, TR
    CHEN, PC
    GEE, C
    BARCHAIM, N
    [J]. IEEE PHOTONICS TECHNOLOGY LETTERS, 1993, 5 (01) : 1 - 3