Discriminative Bernoulli Mixture Models for Handwritten Digit Recognition

被引:6
作者
Gimenez, Adria [1 ]
Andres-Ferrer, J. [1 ]
Juan, Alfons [1 ]
Serrano, Nicolas [1 ]
机构
[1] Univ Politecn Valencia, DSIC, Valencia 46022, Spain
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
Bernoulli mixture; discriminative training; MMI; mixture of multi-class logistic regression; log-linear models;
D O I
10.1109/ICDAR.2011.118
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bernoulli-based models such as Bernoulli mixtures or Bernoulli HMMs (BHMMs), have been successfully applied to several handwritten text recognition (HTR) tasks which range from character recognition to continuous and isolated handwritten words. All these models belong to the generative model family and, hence, are usually trained by (joint) maximum likelihood estimation (MLE). Despite the good properties of the MLE criterion, there are better training criteria such as maximum mutual information (MMI). The MMI is a widespread criterion that is mainly employed to train discriminative models such as log-linear (or maximum entropy) models. Inspired by the Bernoulli mixture classifier, in this work a log-linear model for binary data is proposed, the so-called mixture of multi-class logistic regression. The proposed model is proved to be equivalent to the Bernoulli mixture classifier. In this way, we give a discriminative training framework for Bernoulli mixture models. The proposed discriminative training framework is applied to a well-known Indian digit recognition task.
引用
收藏
页码:558 / 562
页数:5
相关论文
共 10 条
[1]   Databases for recognition of handwritten Arabic cheques [J].
Al-Ohali, Y ;
Cheriet, M ;
Suen, C .
PATTERN RECOGNITION, 2003, 36 (01) :111-121
[2]   GENERALIZED ITERATIVE SCALING FOR LOG-LINEAR MODELS [J].
DARROCH, JN ;
RATCLIFF, D .
ANNALS OF MATHEMATICAL STATISTICS, 1972, 43 (05) :1470-&
[3]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[4]  
Duda R. O., 1973, Pattern Classification and Scene Analysis, V3
[5]  
Gimenez Adria, 2010, Proceedings 2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010), P533, DOI 10.1109/ICFHR.2010.88
[6]   A GIS-like training algorithm for log-linear models with bidden variables [J].
Heigold, Georg ;
Deselaers, Thomas ;
Schlueter, Ralf ;
Ney, Hermann .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4045-4048
[7]  
Juan Alfons., 2004, ICPR
[8]  
Margner V., 2010, Proceedings 2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010), P709, DOI 10.1109/ICFHR.2010.115
[9]  
RIEDMILLER M, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P586, DOI 10.1109/ICNN.1993.298623
[10]  
ROMERO V, 2007, IBPRIA, V4477, P539