A DEEP LEARNING APPROACH TO DOCUMENT IMAGE QUALITY ASSESSMENT

被引：0

作者：

Kang, Le ^{[1
]}

Ye, Peng ^{[1
]}

Li, Yi ^{[2
,3
]}

Doermann, David ^{[1
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

[2] NICTA, Canberra, ACT, Australia

[3] Australian Natl Univ, Canberra, ACT, Australia

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2014年

关键词：

Convolutional neural networks; document; image quality;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper proposes a deep learning approach for document image quality assessment. Given a noise corrupted document image, we estimate its quality score as a prediction of OCR accuracy. First the document image is divided into patches and non-informative patches are sifted out using Otsu's binarization technique. Second, quality scores are obtained for all selected patches using a Convolutional Neural Network (CNN), and the patch scores are averaged over the image to obtain the document score. The proposed CNN contains two layers of convolution, location blind max-min pooling, and Rectified Linear Units in the fully connected layers. Experiments on two document quality datasets show our method achieved the state of the art performance.

引用

页码：2570 / 2574

页数：5

共 19 条

[1]

[Anonymous], 2014, IEEE C COMP VIS PATT

[2]

[Anonymous], 2010, P PYTHON SCI COMPUTI

[3]

[Anonymous], 2010, ADV NEURAL INFORM PR

[4]

[Anonymous], 1999, INT J DOC ANAL RECOG, DOI DOI 10.1007/S100320050039

[5]

Blando L. R., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P319, DOI 10.1109/ICDAR.1995.599003

[6]

Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110

[7] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[8] Mobile Video Capture of Multi-page Documents [J].

Kumar, Jayant ;

Bala, Raja ;

Ding, Hengzhou ;

Emmett, Phillip .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, :35-40

[9]

Kumar J, 2012, INT C PATT RECOG, P3292

[10]

Kumar Jayant., 2013, International Workshop on Camera-Based Document Analysis and Recognition (CBDAR), P39

← 1 2 →