Distilling GRU with Data Augmentation for Unconstrained Handwritten Text Recognition

被引:8
作者
Liu, Manfei [1 ]
Xie, Zecheng [1 ]
Huang, YaoXiong [1 ]
Jin, Lianwen [1 ]
Zhou, Weiyin [1 ]
机构
[1] South China Univ Technol, Coll Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
unconstrained; text recognition; data augmentation; rnn; ONLINE;
D O I
10.1109/ICFHR-2018.2018.00019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten texts with various styles, such as horizontal, overlapping, vertical, and multi-lines texts, are commonly observed in the community. However, most existing handwriting recognition methods only concentrate on one specific kind of text style. In this paper, we focus on the problem of new unconstrained handwritten text recognition and propose distilling gated recurrent unit (GRU) with a new data augmentation technology to model the complex sequential dynamic of unconstrained handwriting text of various styles. The proposed data augmentation method can synthesize realistic handwritten text datasets including horizontal, vertical, overlap, right-down, screw-rotation, and multi-line situation, which render our framework robust for general purposes. The recommended distilling GRU can not only accelerate the training speed through the distilling stage but also maintain the original recognition accuracy. Experiments on our synthesized handwritten test sets show that the proposed multi-layer GRU performs well on the unconstrained handwriting text recognition problem. On the ICDAR2013 handwritten text recognition benchmark dataset, the proposed framework demonstrates comparable performance with state-of-the-art techniques.
引用
收藏
页码:56 / 61
页数:6
相关论文
共 29 条
[1]   Training an End-to-End System for Handwritten Mathematical Expression Recognition by Generated Patterns [J].
Anh Duc Le ;
Nakagawa, Masaki .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :1056-1061
[2]  
Berio D., 2016, SEQUENCE GENERATION
[3]  
Chen K, 2017, PROC INT CONF DOC, P1068, DOI [10.1109/ICCTEC.2017.00234, 10.1109/ICDAR.2017.177]
[4]  
Chollet F., 2015, about us
[5]  
Graves A, 2014, PR MACH LEARN RES, V32, P1764
[6]  
Graves A, 2012, STUD COMPUT INTELL, V385, P5
[7]   A Novel Connectionist System for Unconstrained Handwriting Recognition [J].
Graves, Alex ;
Liwicki, Marcus ;
Fernandez, Santiago ;
Bertolami, Roman ;
Bunke, Horst ;
Schmidhuber, Juergen .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (05) :855-868
[8]  
He P., 2016, READING SCENE TEXT D
[9]   ICDAR 2011 Chinese Handwriting Recognition Competition [J].
Liu, Cheng-Lin ;
Yin, Fei ;
Wang, Qiu-Feng ;
Wang, Da-Han .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :1464-1469
[10]   CASIA Online and Offline Chinese Handwriting Databases [J].
Liu, Cheng-Lin ;
Yin, Fei ;
Wang, Da-Han ;
Wang, Qiu-Feng .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :37-41