Handwritten Data Digitization Using an Anchor based Multi-Channel CNN (MCCNN) Trained on a Hybrid Dataset (h-EH)

被引:5
作者
Chiney, Abhinandan [1 ,2 ]
Paduri, Anwesh Reddy [1 ]
Darapaneni, Narayana [1 ,3 ]
Kulkarni, Santosh [1 ,3 ]
Kadam, Manish [1 ,2 ]
Kohli, Ishan [1 ,4 ]
Subramaniyan, Malarvizhi [1 ]
机构
[1] Great Learning, Pune, Maharashtra, India
[2] Sandvik Mat Technol India Pvt Ltd, Pune 411012, Maharashtra, India
[3] Northwestern Univ, Evanston, IL 60208 USA
[4] Avaya India Pvt Ltd, Pune 411013, Maharashtra, India
来源
AI IN COMPUTATIONAL LINGUISTICS | 2021年 / 189卷
关键词
Hand Written Data Digitization; Anchor based Multi-Channel CNN; OCR; Image Processing; NLP; Language Science; CLASSIFICATION; TEXT;
D O I
10.1016/j.procs.2021.05.095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To develop a holistic system for handwritten English character recognition for manually filled forms by systematically synthesising a robust handwritten textual character dataset for acceptable representation of handwriting. As part of this study, 572 copies of a form were filled by over 200 different individuals to introduce demographic variation. These forms were then scanned and each handwritten character in the forms was labelled and extracted using standard image processing techniques. The dataset of 84, 712 character images created by this method (HW-dataset) comprised of both alphabetical and numerical characters. Three hybrid datasets (h-EH) were then formed by combining EMNIST datasets and the HW-dataset based on Digits (h-EHd - 329, 668 character images), Alphabets (h-EHa - 163, 085 character images) and a mixture of Digits and Alphabets (h-EHm - 189, 586 character images). An anchor based image extraction technique was used in conjunction with a Multi-Channel CNN (MCCNN) model which was trained on three versions of h-EH, to automate the process of digitization of handwritten forms. The classification accuracies of the MCCNN for h-EHa, h-EHd and h-EHm are 93%, 96% and 93% respectively for test data. Models trained on only the EMNIST dataset perform poorly on test data. An anchor based object detection method used in conjunction with MCCNN trained on h-EH produces excellent results in digitising hand filled forms. Touch free solutions will gain prevalence due to the emergence of threat of fomites in the world. In such a space, manual handling of forms for the purpose of data entry, digitization and information handling will be considered as potential health and safety hazards. The solution presented in the current work uses a combination of models which is trained on a hybrid handwritten data set with high demographic variability. The model developed as part of this study is well suited for enabling touch free handling of documents. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:175 / 182
页数:8
相关论文
共 15 条
  • [1] [Anonymous], HDB BRAIN THEORY NEU
  • [2] Bradski G, 2000, DR DOBBS J, V25, P120
  • [3] High-Performance OCR for Printed English and Fraktur using LSTM Networks
    Breuel, Thomas M.
    Ul-Hasan, Adnan
    Al Azawi, Mayce
    Shafait, Faisal
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 683 - 687
  • [4] Chen L, 2015, PROC INT CONF DOC, P431, DOI 10.1109/ICDAR.2015.7333798
  • [5] Cohen G., 2017, EMNIST EXTENSION MNI
  • [6] Darmatasia, 2017, 2017 5 INT C INF COM, P1
  • [7] Improving text classification with weighted word embeddings via a multi-channel TextCNN model
    Guo, Bao
    Zhang, Chunxia
    Liu, Junmin
    Ma, Xiaoyi
    [J]. NEUROCOMPUTING, 2019, 363 : 366 - 374
  • [8] Hussain J., 2018, 2018 INT C ADV COMP, P1
  • [9] Islam Md Nafee Al, 2019, ARXIV190900823 ARXIV190900823
  • [10] Jindal A, 2014, IEEE INT ADV COMPUT, P1028, DOI 10.1109/IAdCC.2014.6779466