Building efficient CNN architecture for offline handwritten Chinese character recognition

被引:33
作者
Li, Zhiyuan [1 ,2 ,3 ]
Teng, Nanjun [1 ,2 ,3 ]
Jin, Min [1 ,2 ,3 ]
Lu, Huaxiang [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Inst Semicond, Lab High Speed Circuit & Neural Networks, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Beijing Key Lab Semicond Neural Network Intellige, Beijing, Peoples R China
[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
关键词
Handwritten Chinese character recognition; Convolutional neural networks; Cascaded CNN; ONLINE;
D O I
10.1007/s10032-018-0311-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks-based methods have brought great breakthrough in image classification, which provides an end-to-end solution for handwritten Chinese character recognition (HCCR) problem through learning discriminative features automatically. Nevertheless, state-of-the-art CNNs appear to incur huge computational cost and require the storage of a large number of parameters especially in fully connected layers, which is difficult to deploy such networks into alternative hardware devices with limited computation capacity. To solve the storage problem, we propose a novel technique called weighted average pooling for reducing the parameters in fully connected layer without loss in accuracy. Besides, we implement a cascaded model in single CNN by adding mid output to complete recognition as early as possible, which reduces average inference time significantly. Experiments are performed on the ICDAR-2013 offline HCCR dataset. It is found that our proposed approach only needs 6.9 ms for classifying a character image on average and achieves the state-of-the-art accuracy of 97.1% while requires only 3.3 MB for storage.
引用
收藏
页码:233 / 240
页数:8
相关论文
共 32 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], XNOR NET IMAGENET CL
[3]  
[Anonymous], PROC ICLR 2015
[4]  
[Anonymous], 2013, INTERSPEECH
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], 2014, BRIT MACH VIS C
[7]  
[Anonymous], 2015, ICLR POSTER
[8]  
[Anonymous], COMPUTER VISION PATT
[9]  
[Anonymous], P 2002 IEEE INT C AC
[10]  
[Anonymous], COMPUTER VISION PATT