Residual Recurrent Neural Network with Sparse Training for Offline Arabic Handwriting Recognition

被引:6
作者
Yan, Ruijie [1 ]
Peng, Liangrui [1 ]
Bin, GuangXiang [1 ]
Wang, Shengjin [1 ]
Cheng, Yao [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[2] China Mobile Hangzhou Informat Technol Co Ltd, Hangzhou, Zhejiang, Peoples R China
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICDAR.2017.171
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Recurrent Neural Networks (RNN) have been suffering from the overfitting problem due to the model redundancy of the network structures. We propose a novel temporal and spatial residual learning method for RNN, followed with sparse training by weight pruning to gain sparsity in network parameters. For a Long Short-Term Memory (LSTM) network, we explore the combination schemes and parameter settings for temporal and spatial residual learning with sparse training. Experiments are carried out on the IFN/ENIT database. For the character error rate on the testing set e while training with sets a, b, c, d, the previously reported best result is 13.42%, and the proposed configuration of temporal residual learning followed with sparse training achieves the state-of-the-art result 12.06%.
引用
收藏
页码:1031 / 1037
页数:7
相关论文
共 20 条
[1]   Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks [J].
Abandah, Gheith A. ;
Jamour, Fuad T. ;
Qaralleh, Esam A. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (03) :275-291
[2]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299173
[3]  
[Anonymous], 1997, Neural Computation
[4]  
[Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.1007/978-1-4471-4072-6_12
[5]   Pruning algorithms of neural networks - a comparative study [J].
Augasta, M. Gethsiyal ;
Kathirvalavakumar, T. .
OPEN COMPUTER SCIENCE, 2013, 3 (03) :105-115
[6]  
Azizi N, 2010, LECT NOTES COMPUT SC, V5997, P235, DOI 10.1007/978-3-642-12127-2_24
[7]   TRAINING WITH NOISE IS EQUIVALENT TO TIKHONOV REGULARIZATION [J].
BISHOP, CM .
NEURAL COMPUTATION, 1995, 7 (01) :108-116
[8]  
Chen J., 2010, 9th IAPR International Workshop on Document Analysis Systems (DAS), P53, DOI 10.1145/1815330.1815337
[9]  
Chen L., 2017, P 1 INT WORKSH AR SC
[10]   A New Design Based-SVM of the CNN Classifier Architecture with Dropout for Offline Arabic Handwritten Recognition [J].
Elleuch, Mohamed ;
Maalej, Rania ;
Kherallah, Monji .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 :1712-1723