Using generative models for handwritten digit recognition

被引:97
|
作者
Revow, M [1 ]
Williams, CKI [1 ]
Hinton, GE [1 ]
机构
[1] ASTON UNIV, DEPT COMP SCI & APPL MATH, BIRMINGHAM B4 7ET, W MIDLANDS, ENGLAND
关键词
deformable model; elastic net; optical character recognition; generative model; probabilistic model; mixture model;
D O I
10.1109/34.506410
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B-splines with Gaussian ''ink generators'' spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the Expectation Maximization (EM) algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages. 1) After identifying the model most likely to have generated the data, the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style. 2) During the process of explaining the image, generative models can perform recognition driven segmentation. 3) The method involves a relatively small number or parameters and hence training is relatively easy and fast. 4) Unlike many other recognition schemes, if does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is it requires much more computation than more standard OCR techniques.
引用
收藏
页码:592 / 606
页数:15
相关论文
共 50 条
  • [31] Automatic recognition of handwritten numerical strings: A recognition and verification strategy
    Oliveira, LS
    Sabourin, R
    Bortolozzi, F
    Suen, CY
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (11) : 1438 - 1454
  • [32] Generative models for functional data using phase and amplitude separation
    Tucker, J. Derek
    Wu, Wei
    Srivastava, Anuj
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 61 : 50 - 66
  • [33] Data augmentation using generative models for track intrusion detection
    Lee, Soohyung
    Kim, Beomseong
    Lee, Heesung
    SCIENCE PROGRESS, 2023, 106 (04)
  • [34] Hindi handwritten character recognition using oriented gradients and Hu-geometric moments
    Yadav, Madhuri
    Purwar, Ravindra Kumar
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [35] Recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features
    Papangkorn Inkeaw
    Phasit Charoenkwan
    Hui-Ling Huang
    Sanparith Marukatat
    Shinn-Ying Ho
    Jeerayut Chaijaruwanich
    International Journal on Document Analysis and Recognition (IJDAR), 2017, 20 : 259 - 274
  • [36] Recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features
    Inkeaw, Papangkorn
    Charoenkwan, Phasit
    Huang, Hui-Ling
    Marukatat, Sanparith
    Ho, Shinn-Ying
    Chaijaruwanich, Jeerayut
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2017, 20 (04) : 259 - 274
  • [37] End-to-End Handwritten Paragraph Text Recognition Using a Vertical Attention Network
    Coquenet, Denis
    Chatelain, Clement
    Paquet, Thierry
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 508 - 524
  • [38] Recognition of Offline Handwritten Chinese Characters Using the Tesseract Open Source OCR Engine
    Li, Qi
    An, Weihua
    Zhou, Anmi
    Ma, Lehui
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 452 - 456
  • [39] Offline Arabic Handwritten Text Recognition: A Survey
    Parvez, Mohammad Tanvir
    Mahmoud, Sabri A.
    ACM COMPUTING SURVEYS, 2013, 45 (02)
  • [40] Offline Handwritten Telugu Character Dataset and Recognition
    Negi, Atul
    Rao, Anish M.
    2019 IEEE 16TH INDIA COUNCIL INTERNATIONAL CONFERENCE (IEEE INDICON 2019), 2019,