High Performance Offline Handwritten Chinese Character Recognition Using GoogLeNet and Directional Feature Maps

被引:0
作者
Zhong, Zhuoyao [1 ]
Jin, Lianwen [1 ]
Xie, Zecheng [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
来源
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2015年
关键词
Deep learning; convolutional neural networks; classifier ensemble; handwritten Chinese character recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Just like its great success in solving many computer vision problems, the convolutional neural networks (CNN) provided new end-to-end approach to handwritten Chinese character recognition (HCCR) with very promising results in recent years. However, previous CNNs so far proposed for HCCR were neither deep enough nor slim enough. We show in this paper that, a deeper architecture can benefit HCCR a lot to achieve higher performance, meanwhile can be designed with less parameters. We also show that the traditional feature extraction methods, such as Gabor or gradient feature maps, are still useful for enhancing the performance of CNN. We design a streamlined version of GoogLeNet 1131, which was original proposed for image classification in recent years with very deep architecture, for HCCR (denoted as HCCR-GoogLeNet). The HCCR-GoogLeNet we used is 19 layers deep but involves with only 7.26 million parameters. Experiments were conducted using the ICDAR 2013 offline HCCR competition dataset. It has been shown that with the proper incorporation with traditional directional feature maps, the proposed single and ensemble HCCR-GoogLeNet models achieve new state of the art recognition accuracy of 96.35% and 96.74%, respectively, outperforming previous best result with significant gap.
引用
收藏
页码:846 / 850
页数:5
相关论文
共 22 条
[1]  
[Anonymous], ICDAR 2013
[2]  
[Anonymous], 1989, P C ADV NEUR INF PRO
[3]  
[Anonymous], ICFHR
[4]  
[Anonymous], ICDAR 2011
[5]  
Ciresan D., 2013, MULTICOLUMN DEEP NEU
[6]  
Ciresan D., ICDAR 2011
[7]  
Ciresan D.C., CVPR 2012
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   COMPLETE DISCRETE 2-D GABOR TRANSFORMS BY NEURAL NETWORKS FOR IMAGE-ANALYSIS AND COMPRESSION [J].
DAUGMAN, JG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (07) :1169-1179
[10]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507