Online Japanese Handwriting Recognizers using Recurrent Neural Networks

被引:5
作者
Hung Tuan Nguyen [1 ]
Cuong Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
Online handwriting recognition; recurrent neural network; preprocessing; transformations; artificial patterns; RECOGNITION; DATABASE;
D O I
10.1109/ICFHR-2018.2018.00082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an attempt to recognize online isolated handwritten Japanese characters as well as text lines by Recurrent Neural Networks (RNNs). Although there are some successful studies on online Chinese handwriting recognition using RNNs, it is difficult to achieve high accuracy for Japanese due to many different types of characters such as kanji and kana. Moreover, training RNNs usually requires a large number of samples for each class of character. Hence, we apply five different transformation operations on the original samples to generate artificial samples with various deformations. For online handwritten Japanese text, we use the Kondate database, but it does not cover the whole Japanese character set. Thus, we generate random text lines using the sentences from corpora and isolated characters from the Nakayosi and Kuchibue databases. Besides, we employ a robust process with some preprocessing steps and the state-of-the-art online features to extract invariant features from handwritten patterns because the handwritten samples are merged from the different databases collected on different devices with various resolutions. For the recognition model, we have implemented the different Bidirectional Long Short-Term Memory Networks (BLSTM networks) for isolated character classification (sequence classification task) and handwritten text recognition (transcription task). The best model of the sequence classification task achieves an accuracy of 97.91% on Nakayosi and 97.74% on Kuchibue. The best model of transcription task performs a character recognition rate of 86.31% on Kondate and 83.15% on the generated text lines.
引用
收藏
页码:435 / 440
页数:6
相关论文
共 20 条
[1]   Development of an efficient neural-based segmentation technique for Arabic handwriting recognition [J].
Al Hamad, Husam A. ;
Abu Zitar, Raed .
PATTERN RECOGNITION, 2010, 43 (08) :2773-2798
[2]  
[Anonymous], 2007, INT C NEUR INF PROC
[3]  
[Anonymous], 1972, Comput. Graph. Image Process., DOI DOI 10.1016/S0146-664X(72)80017-0
[4]   SEGMENTATION-FREE ONLINE ARABIC HANDWRITING RECOGNITION [J].
Biadsy, Fadi ;
Saabni, Raid ;
El-Sana, Jihad .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (07) :1009-1033
[5]   Training of an on-line handwritten Japanese character recognizer by artificial patterns [J].
Chen, Bin ;
Zhu, Bilan ;
Nakagawa, Masaki .
PATTERN RECOGNITION LETTERS, 2014, 35 :178-185
[6]   Semi-Incremental Recognition of On-Line Handwritten Japanese Text [J].
Cuong-Tuan Nguyen ;
Zhu, Bilan ;
Nakagawa, Masaki .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10) :2619-2628
[7]  
Duchi J, 2011, J MACH LEARN RES, V12, P2121
[8]   A database of unconstrained Vietnamese online handwriting and recognition experiments by recurrent neural networks [J].
Hung Tuan Nguyen ;
Cuong Tuan Nguyen ;
Pham The Bao ;
Nakagawa, Masaki .
PATTERN RECOGNITION, 2018, 78 :291-306
[9]   Online handwriting recognition: The NPen++ recognizer [J].
Jaeger S. ;
Manke S. ;
Reichert J. ;
Waibel A. .
International Journal on Document Analysis and Recognition, 2001, Springer Verlag (03) :169-180
[10]   A learning algorithm for structured character pattern representation used in on-line recognition of handwritten Japanese characters [J].
Kitadai, A ;
Nakagawa, M .
EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, :163-168