Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

被引:103
作者
Ahlawat, Savita [1 ]
Choudhary, Amit [2 ]
Nayyar, Anand [3 ]
Singh, Saurabh [4 ]
Yoon, Byungun [4 ]
机构
[1] Maharaja Surajmal Inst Technol, Dept Comp Sci & Engn, New Delhi 110058, India
[2] Maharaja Surajmal Inst, Dept Comp Sci, New Delhi 110058, India
[3] Duy Tan Univ, Grad Sch, Da Nang 550000, Vietnam
[4] Dongguk Univ, Dept Ind & Syst Engn, Seoul 04620, South Korea
关键词
convolutional neural networks; handwritten digit recognition; pre-processing; OCR; FEATURES; OPTIMIZATION; CLASSIFIER; EXTRACTION; ENSEMBLES; SEQUENCE; ONLINE;
D O I
10.3390/s20123344
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Traditional systems of handwriting recognition have relied on handcrafted features and a large amount of prior knowledge. Training an Optical character recognition (OCR) system based on these prerequisites is a challenging task. Research in the handwriting recognition field is focused around deep learning techniques and has achieved breakthrough performance in the last few years. Still, the rapid growth in the amount of handwritten data and the availability of massive processing power demands improvement in recognition accuracy and deserves further investigation. Convolutional neural networks (CNNs) are very effective in perceiving the structure of handwritten characters/words in ways that help in automatic extraction of distinct features and make CNN the most suitable approach for solving handwriting recognition problems. Our aim in the proposed work is to explore the various design options like number of layers, stride size, receptive field, kernel size, padding and dilution for CNN-based handwritten digit recognition. In addition, we aim to evaluate various SGD optimization algorithms in improving the performance of handwritten digit recognition. A network's recognition accuracy increases by incorporating ensemble architecture. Here, our objective is to achieve comparable accuracy by using a pure CNN architecture without ensemble architecture, as ensemble architectures introduce increased computational cost and high testing complexity. Thus, a CNN architecture is proposed in order to achieve accuracy even better than that of ensemble architectures, along with reduced operational complexity and cost. Moreover, we also present an appropriate combination of learning parameters in designing a CNN that leads us to reach a new absolute record in classifying MNIST handwritten digits. We carried out extensive experiments and achieved a recognition accuracy of 99.87% for a MNIST dataset.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 79 条
[1]  
Ahlawat S., 2019, RECENT PAT COMPUT SC, V12, P304, DOI [10.2174/2213275911666181120111342, DOI 10.2174/2213275911666181120111342]
[2]   On building ensembles of stacked denoising auto-encoding classifiers and their further improvement [J].
Alvear-Sandoval, Ricardo F. ;
Figueiras-Vidal, Anibal R. .
INFORMATION FUSION, 2018, 39 :41-52
[3]   Machine Learning from Theory to Algorithms: An Overview [J].
Alzubi, Jafar ;
Nayyar, Anand ;
Kumar, Akshi .
SECOND NATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE (NCCI 2018), 2018, 1142
[4]  
[Anonymous], 2014, P 14 INT C FRONT HAN
[5]  
[Anonymous], 2016, PROCEEDINGS, DOI DOI 10.1109/CVPR.2016.90
[6]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[7]  
Bartlett P., 2008, P NIPS VANC BC CAN 8
[8]   Handwritten Urdu character recognition using one-dimensional BLSTM classifier [J].
Bin Ahmed, Saad ;
Naz, Saeeda ;
Swati, Salahuddin ;
Razzak, Muhammad Imran .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (04) :1143-1151
[9]   Investigation on deep learning for off-line handwritten Arabic character recognition [J].
Boufenar, Chaouki ;
Kerboua, Adlen ;
Batouche, Mohamed .
COGNITIVE SYSTEMS RESEARCH, 2018, 50 :180-195
[10]   Finite-time synchronization by switching state-feedback control for discontinuous Cohen-Grossberg neural networks with mixed delays [J].
Cai, Zuo-Wei ;
Huang, Li-Hong .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (10) :1683-1695