RNN based online handwritten word recognition in Devanagari and Bengali scripts using horizontal zoning

被引:54
作者
Ghosh, Rajib [1 ]
Vamshi, Chirumavila [1 ]
Kumar, Prabhat [1 ]
机构
[1] Natl Inst Technol Patna, Dept CSE, Patna, Bihar, India
关键词
Online handwriting; Word recognition; Indian scripts; Horizontal zone division; RNN; LSTM; BLSTM;
D O I
10.1016/j.patcog.2019.03.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Devanagari and Bengali scripts are two of the most popular scripts in India. Most of the existing word recognition studies in these two scripts have relied upon the widely used Hidden Markov Model (HMM), in spite of its familiar shortcomings. The existing works were evaluated against and performed well in their chosen metrics. But, the existing word recognition systems in these two scripts could not achieve more than 90% recognition accuracy. This article proposes a novel approach for online handwritten cursive and non-cursive word recognition in Devanagari and Bengali scripts based on two recently developed models of Recurrent Neural Network (RNN) Long-Short Term Memory (LSTM) and Bidirectional Long Short Term Memory (BLSTM). The proposed approach divides each word horizontally into three zones upper, middle, and lower, to reduce the variations in basic stroke order within a word. Next, the word portions from middle zone are re-segmented into its basic strokes. Various structural and directional features are then extracted from each basic stroke of the word separately for each zone. These zone wise basic stroke features are then studied using both LSTM and BLSTM versions of RNN. Most of the existing word recognition systems in these two scripts have followed word based class labelling approach, whereas proposed system has followed the basic stroke based class labelling approach. An exhaustive experiment on large datasets has been performed to evaluate the performance of the proposed approach using both RNN and HMM to make a comparative performance analysis. Experimental results show that the proposed RNN based system is superior over HMM achieving 99.50% and 95.24% accuracies in Devanagari and Bengali scripts respectively and outperforms existing HMM based systems in the literature as well. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:203 / 218
页数:16
相关论文
共 28 条
[1]  
[Anonymous], 2006, P 10 INT WORKSH FRON
[2]  
[Anonymous], 2008, P 11 INT C FRON HAND
[3]  
Babu VJ, 2007, PROC INT CONF DOC, P63
[4]  
Baldi P., 2001, Sequence learning. Paradigms, algorithms, and applications (Lecture Notes in Artificial Intelligence Vol.1828), P80
[5]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[6]   HMM-Based Lexicon-Driven and Lexicon-Free Word Recognition for Online Handwritten Indic Scripts [J].
Bharath, A. ;
Madhvanath, Sriganesh .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :670-682
[7]   Representation of online handwriting using multi-component sinusoidal model [J].
Choudhury, Himakshi ;
Prasanna, S. R. Mahadeva .
PATTERN RECOGNITION, 2019, 91 :200-215
[8]  
Fink Gernot A., 2010, Proceedings 2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010), P393, DOI 10.1109/ICFHR.2010.68
[9]  
Fukada T., 1999, Systems and Computers in Japan, V30, P20, DOI 10.1002/(SICI)1520-684X(199904)30:4<20::AID-SCJ3>3.0.CO
[10]  
2-E