Robust Character Recognition For Optical And Natural Images Using Deep Learning

被引:0
作者
Abdali, Al Maamoon Rasool [1 ]
Ghani, Rana Fareed [1 ]
机构
[1] Univ Technol Baghdad, Minist Higher Educ, Comp Sci, Baghdad, Iraq
来源
2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED) | 2019年
关键词
EMINST; convolutional neural network; IC-DAR2003; Char74k; OCR;
D O I
10.1109/scored.2019.8896354
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Character recognition is one of the most critical parts of many computer vision system. And many studies have explored character recognition as two subcategories: character recognition in optical images, character recognition natural images while this separation was to achieve high accuracy in each field separated but it needs more hardware resources to operate two models one for each area. In addition to that most researches divided each field into (digits recognition, character recognition) that add extra cost in both time and hardware resources also we found that both areas still have room for accuracy improvement. This paper tackles the problem of the two subclasses by building one robust, accurate classifier using a convolutional neural network that can recognize (characters and digits) accurately in both optical and natural scene images, the proposed model has been trained on a combination of EMNIST and Char74k data sets with a random data augmentation. The proposed model achieved 92% accuracy in EMINST compared to previous works shows that the proposed model has the highest accuracy among all the previous works based on EMNIST data set. We also tested the model on none-seen data sets (ICDAR203) and the obtained results indicate the high generality and the robustness of the classifier.
引用
收藏
页码:152 / 156
页数:5
相关论文
共 50 条
[41]   Quranic Script Optical Text Recognition Using Deep Learning in IoT Systems [J].
Badry, Mahmoud ;
Hassanin, Mohammed ;
Chandio, Asghar ;
Moustafa, Nour .
CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (02) :1847-1858
[42]   Optical character recognition (OCR) in uncontrolled environments using optical correlators [J].
Morin, A ;
Bergeron, A ;
Prévost, D ;
Radloff, E .
OPTICAL PATTERN RECOGNITION X, 1999, 3715 :346-356
[43]   YinYang, a Fast and Robust Adaptive Document Image Binarization for Optical Character Recognition [J].
Bloechle, Jean-Luc ;
Hennebert, Jean ;
Gisler, Christophe .
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,
[44]   Optical Character Recognition System for Seven Segment Display Images of Measuring Instruments [J].
Ghugardare, Rakhi P. ;
Narote, Sandip P. ;
Mukherji, P. ;
Kulkarni, Prathamesh M. .
TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, :1306-+
[45]   Blurred License Plate Character Recognition Algorithm Based on Deep Learning [J].
Zhang Caizhen ;
Li Ying ;
Kang binlong ;
Chang yuan .
LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (16)
[46]   Handwritten isolated Bangla compound character recognition: A new benchmark using a novel deep learning approach [J].
Roy, Saikat ;
Das, Nibaran ;
Kundu, Mahantapas ;
Nasipuri, Mita .
PATTERN RECOGNITION LETTERS, 2017, 90 :15-21
[47]   Robust mosquito species identification from diverse body and wing images using deep learning [J].
Nolte, Kristopher ;
Sauer, Felix Gregor ;
Baumbach, Jan ;
Kollmannsberger, Philip ;
Lins, Christian ;
Luehken, Renke .
PARASITES & VECTORS, 2024, 17 (01)
[48]   An Arabic optical character recognition system using recognition-based segmentation [J].
Cheung, A ;
Bennamoun, M ;
Bergmann, NW .
PATTERN RECOGNITION, 2001, 34 (02) :215-233
[49]   To Combine or Not to Combine? The Influence of Combining Training Datasets on the Robustness of Deep Learning Models: An Analysis for Optical Character Recognition of Handwriting [J].
Fischer-Brandies, Leopold ;
Mueller, Lucas ;
Rebholz, Benjamin ;
Buettner, Ricardo .
IEEE ACCESS, 2025, 13 :59039-59056
[50]   Handwritten Character Recognition from Images using CNN-ECOC [J].
Bora, Mayur Bhargab ;
Daimary, Dinthisrang ;
Amitab, Khwairakpam ;
Kandar, Debdatta .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 :2403-2409