A Study of Data Augmentation for Handwritten Character Recognition Using Deep Learning

被引:9
作者
Hayashi, Taihei [1 ]
Gyohten, Keiji [1 ]
Ohki, Hidehiro [1 ]
Takami, Toshiya [1 ]
机构
[1] Oita Univ, Fac Sci & Technol, Oita, Japan
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
character recognition; deep learning; data augmentation; statistical character structure model;
D O I
10.1109/ICFHR-2018.2018.00102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While convolutional neural networks have made significant achievements in the field of handwriting recognition in recent years, large amounts of training data are required to obtain satisfactory results. To prepare large amounts of image data for training without increased labor, there is a way of increasing the number of images by applying general image processing methods, so-called data augmentation. However, it is difficult to generate character images like those written by different people and to overcome the problems related to the lack of training data by using conventional data augmentation methods. In this paper, we propose a method of acquiring the probability distribution of the features related to the character structure and generating character images of various handwritings using the probability distribution. The proposed method obtains statistical character structure models composed of probability distributions of strokes by learning from character image data. By generating strokes based on the probability distribution of each stroke and assembling them into a character, it becomes possible to generate character images of various handwriting samples not influenced by the original images. In the comparative experiments of handwritten character recognition with a convolutional neural network, good results could be obtained using not only conventional data augmentation methods but also the proposed method together.
引用
收藏
页码:552 / 557
页数:6
相关论文
共 8 条
[1]   Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals [J].
Bhattacharya, Ujjwal ;
Chaudhuri, B. B. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (03) :444-457
[2]  
Dai R.-W., 2007, COMPUTER SCI, V1, P126
[3]   Statistical character structure modeling and its application to handwritten Chinese character recognition [J].
Kim, IJ ;
Kim, JH .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (11) :1422-1436
[4]  
Miyazaki Tomo, 2017, ARXIV170105703 CORR
[5]  
Nychka D., 2003, S BOULDER GUIDE SPAT
[6]   Comprehensive synthetic Arabic database for on/off-line script recognition research [J].
Saabni, Raid M. ;
El-Sana, Jihad A. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2013, 16 (03) :285-294
[7]  
Wan L, 2013, P 30 INT C MACH LEAR, P1058
[8]   Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark [J].
Zhang, Xu-Yao ;
Bengio, Yoshua ;
Liu, Cheng-Lin .
PATTERN RECOGNITION, 2017, 61 :348-360