Distilling GRU with Data Augmentation for Unconstrained Handwritten Text Recognition

被引:8
作者
Liu, Manfei [1 ]
Xie, Zecheng [1 ]
Huang, YaoXiong [1 ]
Jin, Lianwen [1 ]
Zhou, Weiyin [1 ]
机构
[1] South China Univ Technol, Coll Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
unconstrained; text recognition; data augmentation; rnn; ONLINE;
D O I
10.1109/ICFHR-2018.2018.00019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten texts with various styles, such as horizontal, overlapping, vertical, and multi-lines texts, are commonly observed in the community. However, most existing handwriting recognition methods only concentrate on one specific kind of text style. In this paper, we focus on the problem of new unconstrained handwritten text recognition and propose distilling gated recurrent unit (GRU) with a new data augmentation technology to model the complex sequential dynamic of unconstrained handwriting text of various styles. The proposed data augmentation method can synthesize realistic handwritten text datasets including horizontal, vertical, overlap, right-down, screw-rotation, and multi-line situation, which render our framework robust for general purposes. The recommended distilling GRU can not only accelerate the training speed through the distilling stage but also maintain the original recognition accuracy. Experiments on our synthesized handwritten test sets show that the proposed multi-layer GRU performs well on the unconstrained handwriting text recognition problem. On the ICDAR2013 handwritten text recognition benchmark dataset, the proposed framework demonstrates comparable performance with state-of-the-art techniques.
引用
收藏
页码:56 / 61
页数:6
相关论文
共 29 条
[21]  
Xie Z., 2017, IEEE T PATTERN ANAL
[22]  
Xie Z., 2016, ABS160404953 CORR
[23]   DeepWriterID: An End-to-End Online Text-Independent Writer Identification System [J].
Yang, Weixin ;
Jin, Lianwen ;
Liu, Manfei .
IEEE INTELLIGENT SYSTEMS, 2016, 31 (02) :45-53
[24]  
Yang WX, 2015, PROC INT CONF DOC, P546, DOI 10.1109/ICDAR.2015.7333821
[25]   ICDAR 2013 Chinese Handwriting Recognition Competition [J].
Yin, Fei ;
Wang, Qiu-Feng ;
Zhang, Xu-Yao ;
Liu, Cheng-Lin .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1464-1470
[26]   Drawing and Recognizing Chinese Characters with Recurrent Neural Network [J].
Zhang, Xu-Yao ;
Yin, Fei ;
Zhang, Yan-Ming ;
Liu, Cheng-Lin ;
Bengio, Yoshua .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :849-862
[27]   End-to-End Online Writer Identification With Recurrent Neural Network [J].
Zhang, Xu-Yao ;
Xie, Guo-Sen ;
Liu, Cheng-Lin ;
Bengio, Yoshua .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2017, 47 (02) :285-292
[28]   Minimum-risk training for semi-Markov conditional random fields with application to handwritten Chinese/Japanese text recognition [J].
Zhou, Xiang-Dong ;
Zhang, Yan-Ming ;
Tian, Feng ;
Wang, Hong-An ;
Liu, Cheng-Lin .
PATTERN RECOGNITION, 2014, 47 (05) :1904-1916
[29]   Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields [J].
Zhou, Xiang-Dong ;
Wang, Da-Han ;
Tian, Feng ;
Liu, Cheng-Lin ;
Nakagawa, Masaki .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (10) :2413-2426