BWordDeepNet: a novel deep learning architecture for the recognition of online handwritten Bangla words

被引:0
作者
Bhattacharyya, Ankan [1 ]
Chatterjee, Somnath [2 ]
Sen, Shibaprasad [3 ]
Obaidullah, S. K. M. D. [4 ]
Roy, Kaushik [5 ]
机构
[1] Univ Kentucky, Lexington, KY 40506 USA
[2] Future Inst Engn & Management, Kolkata 700150, India
[3] Techno Main Salt Lake, Kolkata 700091, India
[4] Aliah Univ, Kolkata 700156, India
[5] West Bengal State Univ, Kolkata 700126, India
关键词
Online handwriting; BLSTM; LSTM; GRU; BGRU; Constant error flow;
D O I
10.1007/s11042-023-16709-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online handwritten word recognition (OHR) in low-resource languages such as Bangla is still an open problem. Although the need and importance of OHR are increasing nowadays, research works on word-level recognition are few (specifically for Bangla script), and there is a lot of room for improving recognition performance. In the current work, we employed different Recurrent Neural Network (RNN) architectures such as Long Short-Term Memory (LSTM), Bidirectional Long Short-Term Memory (BLSTM), Gated Recurrent Unit (GRU), and Bidirectional Gated Recurrent Unit (BGRU) for the recognition of online handwritten Bangla words written in an unconstrained domain. One of the challenges includes the variable number of strokes used to write words. This study aims to develop a segmentation-free recognition module where the features from constituent strokes of the word sample are fed to the developed RNN architectures. Sequential and dynamic information obtained from the strokes is considered as the features for the current experiment. The customized architecture of BLSTM known as BWordDeepNet (Bangla Word Deep-learning Network) provides the best performance with 98.35% correct recognition accuracy on the dataset having 7992 online handwritten Bangla word samples. Additionally, the model achieves a numerical gain of 8.08% compared to the Bangla word recognition work mentioned in [38] that was performed on the same word dataset containing 5550 word samples. We have also compared the performance of our proposed model with state-of-the-art techniques used for the same purpose.
引用
收藏
页码:45071 / 45093
页数:23
相关论文
共 44 条
  • [1] A METHOD OF RECOGNITION OF ARABIC CURSIVE HANDWRITING
    ALMUALLIM, H
    YAMAGUCHI, S
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (05) : 715 - 722
  • [2] [Anonymous], 2008, P 11 INT C FRONT HAN
  • [3] [Anonymous], 2011, P 2011 JOINT WORKSH
  • [4] Baghshah MSoleymani., 2006, 2 INT C INF COMM TEC, V1, P1878
  • [5] Bai ZL, 2005, PROC INT CONF DOC, P262
  • [6] Beigi HS, 1994, Arabic and other languages with similar writing styles an on-line digit recognizer
  • [7] Bharath A, 2007, PROC INT CONF DOC, P506
  • [8] HMM-Based Lexicon-Driven and Lexicon-Free Word Recognition for Online Handwritten Indic Scripts
    Bharath, A.
    Madhvanath, Sriganesh
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 670 - 682
  • [9] Stroke-order Normalization for Online Bangla Handwriting Recognition
    Bhattacharya, Nilanj Ana
    Pal, Umapada
    Roy, Partha Pratim
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 206 - 211
  • [10] Bhunia AK, 2015, PROC INT CONF DOC, P636, DOI 10.1109/ICDAR.2015.7333839