Ensemble deep learning model for optical character recognition

被引:2
|
作者
Shetty, Ashish [1 ]
Sharma, Sanjeev [1 ]
机构
[1] Indian Inst Informat Technol, Pune, India
关键词
Character recognition; OCR; Convolution Neural Network; CNN; Deep learning; The Chars74K dataset; Ensemble model;
D O I
10.1007/s11042-023-16018-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In modern deep learning, character recognition in images is a very important field of study due to its has many real life applications. The goal of this paper is to create the state-of-the-art character recognition model using a stacking ensemble of convolution neural networks (CNNs).To develop the proposed ensemble model, we evaluated several CNN models. The models were judged on how well they performed on the Chars74k dataset. The dataset contains 74,103 images divided into 62 classes with labels [A-Z], [a-z], and [0-9]. The accuracy distribution based on the dataset's subgroups (uppercase, lowercase, and digit) is shown in results. The proposed ensemble model achieves state-of-the-art performance with a maximum accuracy of 92.31% on complete dataset, 99.22% on Uppercase alphabets, 98.66% on Lowercase alphabets, 99.77% on Digits, 91.97% on Uppercase+Lowercase alphabets. On the complete and partial datasets, a comparison report between the proposed model and other existing approaches is also displayed. A comparative study of the proposed work and the previous methods is also shown in this paper, in order to demonstrate the effectiveness of the proposed work.
引用
收藏
页码:11411 / 11431
页数:21
相关论文
共 50 条
  • [1] Ensemble deep learning model for optical character recognition
    Ashish Shetty
    Sanjeev Sharma
    Multimedia Tools and Applications, 2024, 83 : 11411 - 11431
  • [2] Optical Character Recognition using Deep Learning: An enhanced Approach
    Amara, Marwa
    Zaghdoud, Radhia
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 545 - 552
  • [3] Optical Character Recognition for Medical Records Digitization with Deep Learning
    Zaryab, Muhammad Ateeque
    Ng, Chuen Rue
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3260 - 3263
  • [4] Optical Character Recognition using Deep Recurrent Attention Model
    Shaker, Mahmoud
    ElHelw, Mohamed
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION (ICRCA 2017), 2017, : 56 - 59
  • [5] Optical Character Recognition System for Czech Language Using Hierarchical Deep Learning Networks
    Chaudhuri, Arindam
    Ghosh, Soumya K.
    APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 114 - 125
  • [6] Character Recognition using Machine Learning and Deep Learning - A Survey
    Sharma, Reya
    Kaushik, Baijnath
    Gondhi, Naveen
    2020 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2020, : 341 - 345
  • [7] Deep Learning Based Sinhala Optical Character Recognition (OCR)
    Anuradha, Isuri
    Liyanage, Chamila
    Wijayawardhana, Harsha
    Weerasinghe, Ruvan
    2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 298 - 299
  • [8] Deep Learning for Optical Character Recognition and Its Application to VAT Invoice Recognition
    Wang, Yu
    Gui, Guan
    Zhao, Nan
    Yin, Yue
    Huang, Hao
    Li, Yunyi
    Wang, Jie
    Yang, Jie
    Zhang, Haijun
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 87 - 95
  • [9] Deep Learning Strategy for Braille Character Recognition
    Kausar, Tasleem
    Manzoor, Sajjad
    Kausar, Adeeba
    Lu, Yun
    Wasif, Muhammad
    Ashraf, M. Adnan
    IEEE ACCESS, 2021, 9 : 169357 - 169371
  • [10] Enhanced Ensemble Technique for Optical Character Recognition
    Habeeb, Imad Qasim
    Al-Zaydi, Zeyad Qasim
    Abdulkhudhur, Hanan Najm
    NEW TRENDS IN INFORMATION AND COMMUNICATIONS TECHNOLOGY APPLICATIONS, NTICT 2018, 2018, 938 : 213 - 225