Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines

被引:45
作者
Koelsch, Andreas [1 ,2 ]
Afzal, Muhammad Zeshan [1 ,2 ]
Ebbecke, Markus [2 ]
Liwicki, Marcus [1 ,2 ,3 ]
机构
[1] Univ Kaiserslautern, MindGarage, Kaiserslautern, Germany
[2] Insiders Technol GmbH, Kaiserslautern, Germany
[3] Univ Fribourg, Fribourg, Switzerland
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
关键词
Document Image Classification; Deep CNN; Convolutional Neural Network; Transfer Learning;
D O I
10.1109/ICDAR.2017.217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an approach for real-time training and testing for document image classification. In production environments, it is crucial to perform accurate and (time-)efficient training. Existing deep learning approaches for classifying documents do not meet these requirements, as they require much time for training and fine-tuning the deep architectures. Motivated from Computer Vision, we propose a two-stage approach. The first stage trains a deep network that works as feature extractor and in the second stage, Extreme Learning Machines (ELMs) are used for classification. The proposed approach outperforms all previously reported structural and deep learning based methods with a final accuracy of 83.24% on Tobacco-3482 dataset, leading to a relative error reduction of 25% when compared to a previous Convolutional Neural Network (CNN) based approach (DeepDocClassifier). More importantly, the training time of the ELM is only 1.176 seconds and the overall prediction time for 2, 482 images is 3.066 seconds. As such, this novel approach makes deep learning-based document classification suitable for large-scale real-time applications.
引用
收藏
页码:1318 / 1323
页数:6
相关论文
共 23 条
[1]  
Afzal MZ, 2015, PROC INT CONF DOC, P1111, DOI 10.1109/ICDAR.2015.7333933
[2]  
[Anonymous], ICPR
[3]  
[Anonymous], PROC CVPR IEEE
[4]  
[Anonymous], CLUSTERING BASED ALG
[5]  
[Anonymous], 2007, Supervised machine learning: A review of classification techniques
[6]   Fine-grained document genre classification using first order random graphs [J].
Bagdanov, AD ;
Worring, M .
SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, :79-83
[7]  
Byun Yungcheol., 2000, Proceedings of the 2000 ACM Symposium on Applied Computing, V1, P1
[8]  
Chen SY, 2012, INT C PATT RECOG, P653
[9]  
Dengel A., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P587, DOI 10.1109/ICDAR.1995.601965
[10]   Using differential evolution for fine tuning naive Bayesian classifiers and its application for text classification [J].
Diab, Diab M. ;
El Hindi, Khalil M. .
APPLIED SOFT COMPUTING, 2017, 54 :183-199