Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

被引:96
作者
Ahlawat, Savita [1 ]
Choudhary, Amit [2 ]
Nayyar, Anand [3 ]
Singh, Saurabh [4 ]
Yoon, Byungun [4 ]
机构
[1] Maharaja Surajmal Inst Technol, Dept Comp Sci & Engn, New Delhi 110058, India
[2] Maharaja Surajmal Inst, Dept Comp Sci, New Delhi 110058, India
[3] Duy Tan Univ, Grad Sch, Da Nang 550000, Vietnam
[4] Dongguk Univ, Dept Ind & Syst Engn, Seoul 04620, South Korea
关键词
convolutional neural networks; handwritten digit recognition; pre-processing; OCR; FEATURES; OPTIMIZATION; CLASSIFIER; EXTRACTION; ENSEMBLES; SEQUENCE; ONLINE;
D O I
10.3390/s20123344
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Traditional systems of handwriting recognition have relied on handcrafted features and a large amount of prior knowledge. Training an Optical character recognition (OCR) system based on these prerequisites is a challenging task. Research in the handwriting recognition field is focused around deep learning techniques and has achieved breakthrough performance in the last few years. Still, the rapid growth in the amount of handwritten data and the availability of massive processing power demands improvement in recognition accuracy and deserves further investigation. Convolutional neural networks (CNNs) are very effective in perceiving the structure of handwritten characters/words in ways that help in automatic extraction of distinct features and make CNN the most suitable approach for solving handwriting recognition problems. Our aim in the proposed work is to explore the various design options like number of layers, stride size, receptive field, kernel size, padding and dilution for CNN-based handwritten digit recognition. In addition, we aim to evaluate various SGD optimization algorithms in improving the performance of handwritten digit recognition. A network's recognition accuracy increases by incorporating ensemble architecture. Here, our objective is to achieve comparable accuracy by using a pure CNN architecture without ensemble architecture, as ensemble architectures introduce increased computational cost and high testing complexity. Thus, a CNN architecture is proposed in order to achieve accuracy even better than that of ensemble architectures, along with reduced operational complexity and cost. Moreover, we also present an appropriate combination of learning parameters in designing a CNN that leads us to reach a new absolute record in classifying MNIST handwritten digits. We carried out extensive experiments and achieved a recognition accuracy of 99.87% for a MNIST dataset.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 79 条
  • [1] Ahlawat S., 2018, RECENT PAT COMPUT SC, V12, P304, DOI [10.2174/2213275911666181120111342, DOI 10.2174/2213275911666181120111342]
  • [2] On building ensembles of stacked denoising auto-encoding classifiers and their further improvement
    Alvear-Sandoval, Ricardo F.
    Figueiras-Vidal, Anibal R.
    [J]. INFORMATION FUSION, 2018, 39 : 41 - 52
  • [3] Machine Learning from Theory to Algorithms: An Overview
    Alzubi, Jafar
    Nayyar, Anand
    Kumar, Akshi
    [J]. SECOND NATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE (NCCI 2018), 2018, 1142
  • [4] [Anonymous], 2014, P 14 INT C FRONT HAN
  • [5] [Anonymous], 2017, IEEE T NEURAL NETWOR
  • [6] [Anonymous], 2011, CORR
  • [7] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [8] Bartlett P., 2008, P NIPS VANC BC CAN 8
  • [9] Handwritten Urdu character recognition using one-dimensional BLSTM classifier
    Bin Ahmed, Saad
    Naz, Saeeda
    Swati, Salahuddin
    Razzak, Muhammad Imran
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (04) : 1143 - 1151
  • [10] Investigation on deep learning for off-line handwritten Arabic character recognition
    Boufenar, Chaouki
    Kerboua, Adlen
    Batouche, Mohamed
    [J]. COGNITIVE SYSTEMS RESEARCH, 2018, 50 : 180 - 195