A DEEP LEARNING APPROACH TO DOCUMENT IMAGE QUALITY ASSESSMENT

被引:0
作者
Kang, Le [1 ]
Ye, Peng [1 ]
Li, Yi [2 ,3 ]
Doermann, David [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] NICTA, Canberra, ACT, Australia
[3] Australian Natl Univ, Canberra, ACT, Australia
来源
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2014年
关键词
Convolutional neural networks; document; image quality;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a deep learning approach for document image quality assessment. Given a noise corrupted document image, we estimate its quality score as a prediction of OCR accuracy. First the document image is divided into patches and non-informative patches are sifted out using Otsu's binarization technique. Second, quality scores are obtained for all selected patches using a Convolutional Neural Network (CNN), and the patch scores are averaged over the image to obtain the document score. The proposed CNN contains two layers of convolution, location blind max-min pooling, and Rectified Linear Units in the fully connected layers. Experiments on two document quality datasets show our method achieved the state of the art performance.
引用
收藏
页码:2570 / 2574
页数:5
相关论文
共 19 条
[1]  
[Anonymous], 2014, IEEE C COMP VIS PATT
[2]  
[Anonymous], 2010, P PYTHON SCI COMPUTI
[3]  
[Anonymous], 2010, ADV NEURAL INFORM PR
[4]  
[Anonymous], 1999, INT J DOC ANAL RECOG, DOI DOI 10.1007/S100320050039
[5]  
Blando L. R., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P319, DOI 10.1109/ICDAR.1995.599003
[6]  
Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110
[7]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[8]   Mobile Video Capture of Multi-page Documents [J].
Kumar, Jayant ;
Bala, Raja ;
Ding, Hengzhou ;
Emmett, Phillip .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, :35-40
[9]  
Kumar J, 2012, INT C PATT RECOG, P3292
[10]  
Kumar Jayant., 2013, International Workshop on Camera-Based Document Analysis and Recognition (CBDAR), P39