Residual Recurrent Neural Network with Sparse Training for Offline Arabic Handwriting Recognition

被引:6
作者
Yan, Ruijie [1 ]
Peng, Liangrui [1 ]
Bin, GuangXiang [1 ]
Wang, Shengjin [1 ]
Cheng, Yao [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[2] China Mobile Hangzhou Informat Technol Co Ltd, Hangzhou, Zhejiang, Peoples R China
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICDAR.2017.171
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Recurrent Neural Networks (RNN) have been suffering from the overfitting problem due to the model redundancy of the network structures. We propose a novel temporal and spatial residual learning method for RNN, followed with sparse training by weight pruning to gain sparsity in network parameters. For a Long Short-Term Memory (LSTM) network, we explore the combination schemes and parameter settings for temporal and spatial residual learning with sparse training. Experiments are carried out on the IFN/ENIT database. For the character error rate on the testing set e while training with sets a, b, c, d, the previously reported best result is 13.42%, and the proposed configuration of temporal residual learning followed with sparse training achieves the state-of-the-art result 12.06%.
引用
收藏
页码:1031 / 1037
页数:7
相关论文
共 20 条
[11]  
Girija S.S., 2016, TENSORFLOW LARGE SCA
[12]  
Graves A., 2006, INT C MACH LEARN
[13]  
Han Song, 2016, ARXIV160704381
[14]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[15]  
Margner Y., 2006, ACT 9 C INT FRANC EC, P259
[16]   Combining Slanted-Frame Classifiers for Improved HMM-Based Arabic Handwriting Recognition [J].
Mohamad, Ramy Al-Hajj ;
Likforman-Sulem, Laurence ;
Mokbel, Chafic .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (07) :1165-1177
[17]  
Pechwitz M., 2002, P CIFED CIT, P127
[18]  
Srivastava N, 2014, J MACH LEARN RES, V15, P1929
[19]  
Wang, 2016, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, P938
[20]   A comparison of 1D and 2D LSTM architectures for the recognition of handwritten Arabic [J].
Yousefi, Mohammad Reza ;
Soheili, Mohammad Reza ;
Breuel, Thomas M. ;
Stricker, Didier .
DOCUMENT RECOGNITION AND RETRIEVAL XXII, 2015, 9402