Residual Recurrent Neural Network with Sparse Training for Offline Arabic Handwriting Recognition

被引：6

作者：

Yan, Ruijie ^{[1
]}

Peng, Liangrui ^{[1
]}

Bin, GuangXiang ^{[1
]}

Wang, Shengjin ^{[1
]}

Cheng, Yao ^{[2
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

[2] China Mobile Hangzhou Informat Technol Co Ltd, Hangzhou, Zhejiang, Peoples R China

来源：

2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICDAR.2017.171

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Recurrent Neural Networks (RNN) have been suffering from the overfitting problem due to the model redundancy of the network structures. We propose a novel temporal and spatial residual learning method for RNN, followed with sparse training by weight pruning to gain sparsity in network parameters. For a Long Short-Term Memory (LSTM) network, we explore the combination schemes and parameter settings for temporal and spatial residual learning with sparse training. Experiments are carried out on the IFN/ENIT database. For the character error rate on the testing set e while training with sets a, b, c, d, the previously reported best result is 13.42%, and the proposed configuration of temporal residual learning followed with sparse training achieves the state-of-the-art result 12.06%.

引用

页码：1031 / 1037

页数：7

共 20 条

[1] Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks [J].

Abandah, Gheith A. ;

Jamour, Fuad T. ;

Qaralleh, Esam A. .

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (03) :275-291

[2]

[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299173

[3]

[Anonymous], 1997, Neural Computation

[4]

[Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.1007/978-1-4471-4072-6_12

[5] Pruning algorithms of neural networks - a comparative study [J].

Augasta, M. Gethsiyal ;

Kathirvalavakumar, T. .

OPEN COMPUTER SCIENCE, 2013, 3 (03) :105-115

[6]

Azizi N, 2010, LECT NOTES COMPUT SC, V5997, P235, DOI 10.1007/978-3-642-12127-2_24

[7] TRAINING WITH NOISE IS EQUIVALENT TO TIKHONOV REGULARIZATION [J].

BISHOP, CM .

NEURAL COMPUTATION, 1995, 7 (01) :108-116

[8]

Chen J., 2010, 9th IAPR International Workshop on Document Analysis Systems (DAS), P53, DOI 10.1145/1815330.1815337

[9]

Chen L., 2017, P 1 INT WORKSH AR SC

[10] A New Design Based-SVM of the CNN Classifier Architecture with Dropout for Offline Arabic Handwritten Recognition [J].

Elleuch, Mohamed ;

Maalej, Rania ;

Kherallah, Monji .

INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 :1712-1723

← 1 2 →