Enhancement of handwritten text recognition using AI-based hybrid approach

被引:1
作者
Mahadevkar, Supriya [1 ]
Patil, Shruti [2 ]
Kotecha, Ketan [2 ]
机构
[1] Symbiosis Int Deemed Univ, Symbiosis Inst Technol, Pune 412115, India
[2] Symbiosis Int Deemed Univ, Symbiosis Inst Technol, Symbiosis Ctr Appl Artificial Intelligence SCAAI, Pune 412115, India
关键词
Handwritten text recognition; Long short-term memory (LSTM); Connectionist temporal classification (CTC); Machine learning; Deep learning; Etc;
D O I
10.1016/j.mex.2024.102654
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Handwritten text recognition (HTR) within computer vision and image processing stands as a prominent and challenging research domain, holding significant implications for diverse applications. Among these, it finds usefulness in reading bank checks, prescriptions, and deciphering characters on various forms. Optical character recognition (OCR) technology, specifically tailored for handwritten documents, plays a pivotal role in translating characters from a range of file formats, encompassing both word and image documents. Challenges in HTR encompass intricate layout designs, varied handwriting styles, limited datasets, and less accuracy achieved. Recent advancements in Deep Learning and Machine Learning algorithms, coupled with the vast repositories of unprocessed data, have propelled researchers to achieve remarkable progress in HTR. This paper aims to address the challenges in handwritten text recognition by proposing a hybrid approach. The primary objective is to enhance the accuracy of recognizing handwritten text from images. Through the integration of Convolutional Neural Networks (CNN) and Bidirectional Long Short -Term Memory (BiLSTM) with a Connectionist Temporal Classification (CTC) decoder, the results indicate substantial improvement. The proposed hybrid model achieved an impressive 98.50% and 98.80% accuracy on the IAM and RIMES datasets, respectively. This underscores the potential and efficacy of the consecutive use of these advanced neural network architectures in enhancing handwritten text recognition accuracy. center dot The proposed method introduces a hybrid approach for handwritten text recognition, employing CNN and BiLSTM with CTC decoder. center dot Results showcase a remarkable accuracy improvement of 98.50% and 98.80% on IAM and RIMES datasets, emphasizing the potential of this model for enhanced accuracy in recognizing handwritten text from images.
引用
收藏
页数:11
相关论文
共 10 条
  • [1] Coquenet D, 2021, Arxiv, DOI arXiv:2012.03868
  • [2] Hemanth G. R., 2021, ICTACT Journal on Soft Computing, P2457
  • [3] A Review on Machine Learning Styles in Computer Vision-Techniques and Future Directions
    Mahadevkar, Supriya, V
    Khemani, Bharti
    Patil, Shruti
    Kotecha, Ketan
    Vora, Deepali R.
    Abraham, Ajith
    Gabralla, Lubna Abdelkareim
    [J]. IEEE ACCESS, 2022, 10 : 107293 - 107329
  • [4] Enhanced Attention-Based Encoder-Decoder Framework for Text Recognition
    Prabu, S.
    Sundar, K. Joseph Abraham
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02) : 2071 - 2086
  • [5] Handwritten optical character recognition by hybrid neural network training algorithm
    Sampath, A. K.
    Gomathi, N.
    [J]. IMAGING SCIENCE JOURNAL, 2019, 67 (07) : 359 - 373
  • [6] CNN-RNN based method for license plate recognition
    Shivakumara, Palaiahnakote
    Tang, Dongqi
    Asadzadehkaljahi, Maryam
    Lu, Tong
    Pal, Umapada
    Anisi, Mohammad Hossein
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (03) : 169 - 175
  • [7] Offline continuous handwriting recognition using sequence to sequence neural networks
    Sueiras, Jorge
    Ruiz, Victoria
    Sanchez, Angel
    Velez, Jose F.
    [J]. NEUROCOMPUTING, 2018, 289 : 119 - 128
  • [8] Tensmeyer C., 2019, ICDAR, P1
  • [9] A New Deep Learning-Based Handwritten Character Recognition System on Mobile Computing Devices
    Weng, Yu
    Xia, Chunlei
    [J]. MOBILE NETWORKS & APPLICATIONS, 2020, 25 (02) : 402 - 411
  • [10] Start, Follow, Read: End-to-End Full-Page Handwriting Recognition
    Wigington, Curtis
    Tensmeyer, Chris
    Davis, Brian
    Barrett, William
    Price, Brian
    Cohen, Scott
    [J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 372 - 388