Font adaptation of an HMM-based OCR system

被引:0
作者
Ait-Mohand, Kamel [1 ]
Heutte, Laurent [1 ]
Paquet, Thierry [1 ]
Ragot, Nicolas [2 ]
机构
[1] Univ Rouen, LITIS EA 4108, Ave Univ,BP 8, F-76801 St Etienne, France
[2] Univ Francois Rabelais Tours, F-37200 Tours, France
来源
DOCUMENT RECOGNITION AND RETRIEVAL XVII | 2010年 / 7534卷
关键词
OCR; font adaptation; MAP; MLLR; HMM;
D O I
10.1117/12.840321
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We create a polyfont OCR recognizer using HMM (Hidden Markov models) models of character trained on a dataset of various fonts. We compare this system to monofont recognizers showing its decrease of performance when it is used to recognize unseen fonts. In order to fill this gap of performance, we adapt the parameters of the models of the polyfont recognizer to a new dataset of unseen fonts using four different adaptation algorithms. The results of our experiments show that the adapted system is far more accurate than the initial system although it does not reach the accuracy of a monofont recognizer.
引用
收藏
页数:8
相关论文
共 14 条
  • [1] Baird H.S., 1995, Document Image Analysis. Chapter Document Image Defect Models, P315
  • [2] BAIRD HS, 2000, P 1 INT C DOC AN REC, P332
  • [3] A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS
    BAUM, LE
    PETRIE, T
    SOULES, G
    WEISS, N
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01): : 164 - &
  • [4] E-Hajj R, 2005, PROC INT CONF DOC, P893
  • [5] Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
    Gauvain, Jean-Luc
    Lee, Chin-Hui
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 291 - 298
  • [6] MAXIMUM-LIKELIHOOD LINEAR-REGRESSION FOR SPEAKER ADAPTATION OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS
    LEGGETTER, CJ
    WOODLAND, PC
    [J]. COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) : 171 - 185
  • [7] A script-independent methodology for optical character recognition
    Makhoul, J
    Schwartz, R
    Lapre, C
    Bazzi, I
    [J]. PATTERN RECOGNITION, 1998, 31 (09) : 1285 - 1294
  • [8] MAROSI I, 2007, P SPIE, V6500
  • [9] A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION
    RABINER, LR
    [J]. PROCEEDINGS OF THE IEEE, 1989, 77 (02) : 257 - 286
  • [10] Vinciarelli A, 2004, IEEE T PATTERN ANAL, V26, P709, DOI 10.1109/TPAMI.2004.14