Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

被引:96
作者
Ahlawat, Savita [1 ]
Choudhary, Amit [2 ]
Nayyar, Anand [3 ]
Singh, Saurabh [4 ]
Yoon, Byungun [4 ]
机构
[1] Maharaja Surajmal Inst Technol, Dept Comp Sci & Engn, New Delhi 110058, India
[2] Maharaja Surajmal Inst, Dept Comp Sci, New Delhi 110058, India
[3] Duy Tan Univ, Grad Sch, Da Nang 550000, Vietnam
[4] Dongguk Univ, Dept Ind & Syst Engn, Seoul 04620, South Korea
关键词
convolutional neural networks; handwritten digit recognition; pre-processing; OCR; FEATURES; OPTIMIZATION; CLASSIFIER; EXTRACTION; ENSEMBLES; SEQUENCE; ONLINE;
D O I
10.3390/s20123344
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Traditional systems of handwriting recognition have relied on handcrafted features and a large amount of prior knowledge. Training an Optical character recognition (OCR) system based on these prerequisites is a challenging task. Research in the handwriting recognition field is focused around deep learning techniques and has achieved breakthrough performance in the last few years. Still, the rapid growth in the amount of handwritten data and the availability of massive processing power demands improvement in recognition accuracy and deserves further investigation. Convolutional neural networks (CNNs) are very effective in perceiving the structure of handwritten characters/words in ways that help in automatic extraction of distinct features and make CNN the most suitable approach for solving handwriting recognition problems. Our aim in the proposed work is to explore the various design options like number of layers, stride size, receptive field, kernel size, padding and dilution for CNN-based handwritten digit recognition. In addition, we aim to evaluate various SGD optimization algorithms in improving the performance of handwritten digit recognition. A network's recognition accuracy increases by incorporating ensemble architecture. Here, our objective is to achieve comparable accuracy by using a pure CNN architecture without ensemble architecture, as ensemble architectures introduce increased computational cost and high testing complexity. Thus, a CNN architecture is proposed in order to achieve accuracy even better than that of ensemble architectures, along with reduced operational complexity and cost. Moreover, we also present an appropriate combination of learning parameters in designing a CNN that leads us to reach a new absolute record in classifying MNIST handwritten digits. We carried out extensive experiments and achieved a recognition accuracy of 99.87% for a MNIST dataset.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 79 条
  • [21] Cost-conscious classifier ensembles
    Demir, C
    Alpaydin, E
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (14) : 2206 - 2214
  • [22] Dewan S., 2012, P INT C NEUR INF PRO
  • [23] Dietterich T. G., 1995, Journal of Artificial Intelligence Research, V2, P263
  • [24] Do C.B., 2009, P ICML MONTR QC CAN
  • [25] Duchi J, 2011, J MACH LEARN RES, V12, P2121
  • [26] Seeing it all: Convolutional network layers map the function of the human visual system
    Eickenberg, Michael
    Gramfort, Alexandre
    Varoquaux, Gael
    Thirion, Bertrand
    [J]. NEUROIMAGE, 2017, 152 : 184 - 194
  • [27] Fergus L, 2013, PROC 30 INT C INT C, P1058
  • [29] Multiobjective optimization for recognition of isolated handwritten Indic scripts
    Gupta, Anisha
    Sarkhel, Ritesh
    Das, Nibaran
    Kundu, Mahantapas
    [J]. PATTERN RECOGNITION LETTERS, 2019, 128 : 318 - 325
  • [30] He K., 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.90