Using generative models for handwritten digit recognition

被引:97
|
作者
Revow, M [1 ]
Williams, CKI [1 ]
Hinton, GE [1 ]
机构
[1] ASTON UNIV, DEPT COMP SCI & APPL MATH, BIRMINGHAM B4 7ET, W MIDLANDS, ENGLAND
关键词
deformable model; elastic net; optical character recognition; generative model; probabilistic model; mixture model;
D O I
10.1109/34.506410
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B-splines with Gaussian ''ink generators'' spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the Expectation Maximization (EM) algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages. 1) After identifying the model most likely to have generated the data, the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style. 2) During the process of explaining the image, generative models can perform recognition driven segmentation. 3) The method involves a relatively small number or parameters and hence training is relatively easy and fast. 4) Unlike many other recognition schemes, if does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is it requires much more computation than more standard OCR techniques.
引用
收藏
页码:592 / 606
页数:15
相关论文
共 50 条
  • [42] Segmentation Techniques for Handwritten script Recognition System
    Mathew, Jibu C.
    Shinde, Ravi C.
    Patil, C. Y.
    2015 INTERNATIONAL CONFERENCED ON CIRCUITS, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2015), 2015,
  • [43] Advances in online handwritten recognition in the last decades
    Ghosh, Trishita
    Sen, Shibaprasad
    Obaidullah, Sk. Md.
    Santosh, K. C.
    Roy, Kaushik
    Pal, Umapada
    COMPUTER SCIENCE REVIEW, 2022, 46
  • [44] A comprehensive survey on Bangla handwritten numeral recognition
    Singh, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2018, 5 (01) : 55 - 71
  • [45] Devanagari Offline Handwritten Numeral and Character Recognition using Multiple Features and Neural Network Classifier
    Dongre, Vikas J.
    Mankar, Vijay H.
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 425 - 431
  • [46] Handwritten Arabic Character Recognition for Children Writing Using Convolutional Neural Network and Stroke Identification
    Mais Alheraki
    Rawan Al-Matham
    Hend Al-Khalifa
    Human-Centric Intelligent Systems, 2023, 3 (2): : 147 - 159
  • [47] New Methods of Designing Stamping Dies Assemblies by Using Generative Models
    Skarka, Wojciech
    Neumann, Tomasz
    TRANSDISCIPLINARY ENGINEERING: A PARADIGM SHIFT, 2017, 5 : 456 - 463
  • [48] Recovering compressed images for automatic crack segmentation using generative models
    Huang, Yong
    Zhang, Haoyu
    Li, Hui
    Wu, Stephen
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 146
  • [49] A scarce dataset for ancient Arabic handwritten text recognition
    Najam, Rayyan
    Faizullah, Safiullah
    DATA IN BRIEF, 2024, 56
  • [50] Offline Handwritten Script Recognition Based on Texture Descriptors
    Roberto e Souza, Marcos
    Bertolini, Diego
    Pedrini, Helio
    Costa, Yandre M. G.
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2019), 2019, : 57 - 62