Handwritten word-spotting using hidden Markov models and universal vocabularies

被引:115
作者
Rodriguez-Serrano, Jose A. [1 ]
Perronnin, Florent [2 ]
机构
[1] Univ Autonoma Barcelona, CVC, Bellaterra 08193, Spain
[2] Xerox Res Ctr Europe, F-38240 Meylan, France
关键词
Word-spotting; Hidden Markov model; Score normalization; Universal vocabulary; Handwriting recognition; TEXT LINE; RECOGNITION; DOCUMENTS; SEGMENTATION; FEATURES;
D O I
10.1016/j.patcog.2009.02.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten word-spotting is traditionally viewed as an image matching task between one or multiple query word-images and a set of candidate word-images in a database. This is a typical instance of the query-by-example paradigm. In this article, we introduce a statistical framework for the word-spotting problem which employs hidden Markov models (HMMs) to model keywords and a Gaussian mixture model (GMM) for score normalization. We explore the use of two types of HMMs for the word modeling part: continuous HMMs (C-HMMs) and semi-continuous HMMs (SC-HMMs), i.e. HMMs with a shared set of Gaussians. We show on a challenging multi-writer corpus that the proposed statistical framework is always superior to a traditional matching system which uses dynamic time warping (DTW) for word-image distance computation. A very important finding is that the SC-HMM is superior when labeled training data is scarce-as low as one sample per keyword-thanks to the prior information which can be incorporated in the shared set of Gaussians. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2106 / 2116
页数:11
相关论文
共 57 条
[1]   Word matching using single closed contours for indexing handwritten historical documents [J].
Adamek, Tornasz ;
O'Connor, Noel E. ;
Smeaton, Alan F. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :153-165
[2]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[3]   TIED MIXTURE CONTINUOUS PARAMETER MODELING FOR SPEECH RECOGNITION [J].
BELLEGARDA, JR ;
NAHAMOO, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (12) :2033-2045
[4]  
Brakensiek A, 2003, PROC INT CONF DOC, P294
[5]  
BRAKENSIEK A, 2004, HANDWRITTEN ADDRESS, P103
[6]   OFF-LINE CURSIVE HANDWRITING RECOGNITION USING HIDDEN MARKOV-MODELS [J].
BUNKE, H ;
ROTH, M ;
SCHUKATTALAMAZZINI, EG .
PATTERN RECOGNITION, 1995, 28 (09) :1399-1413
[7]  
Bunke H, 2003, PROC INT CONF DOC, P448
[8]  
CAO H, 2007, 6 INT C ADV PATT REC
[9]  
CHEN MY, 1994, IEEE T PATTERN ANAL, V16, P481
[10]   A HIGH-SPEED INGAASP/INP DFB LASER WITH AN AIR BRIDGE CONTACT CONFIGURATION [J].
CHEN, TR ;
CHEN, PC ;
GEE, C ;
BARCHAIM, N .
IEEE PHOTONICS TECHNOLOGY LETTERS, 1993, 5 (01) :1-3