Fast Binarization of Unevenly Illuminated Document Images Based on Background Estimation for Optical Character Recognition Purposes

被引:0
作者
Michalak, Hubert [1 ]
Okarma, Krzysztof [1 ]
机构
[1] West Pomeranian Univ Technol, Szczecin, Poland
关键词
binarization; OCR; document image analysis;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
One of the key operations during the image preprocessing step in Optical Character Recognition (OCR) algorithms is image binarization. Although for uniformly illuminated images, obtained typically by flatbed scanners, the use of a single global threshold may be sufficient for further recognition of individual characters, it cannot be applied directly in case of non-uniform lightened document images. Such problem may occur during capturing photos of documents in unknown lighting conditions making a proper text recognition impossible in some parts of the image. Since the application of popular adaptive thresholding methods, e.g. Niblack, Sauvola and their modifications, based on the analysis of the neighbourhood of each pixel is time consuming, a faster solution might be the division of images into blocks or elimination of non-uniform background. Such an approach can be considered as a balance solution filling the gap between global and local adaptive thresholding. The solution proposed in the paper, useful also for various mobile devices due to limited computational requirements, is based on the approximation of lighting distribution of the background using the reduced resolution images. The proposed method allows to obtain very good OCR results being superior in comparison to typical adaptive binarization algorithms both in terms of the resulting OCR accuracy and computational efficiency.
引用
收藏
页码:627 / 646
页数:20
相关论文
共 37 条
[1]  
Bersen J., 1986, Eighth International Conference on Pattern Recognition. Proceedings (Cat. No.86CH2342-4), P1251
[2]  
Bradley Derek, 2007, Journal of Graphics Tools, V12, P13
[3]   Broken and degraded document images binarization [J].
Chen, Yiping ;
Wang, Liansheng .
NEUROCOMPUTING, 2017, 237 :272-280
[4]   A binarization method with learning-built rules for document images produced by cameras [J].
Chou, Chien-Hsing ;
Lin, Wen-Hsiung ;
Chang, Fu .
PATTERN RECOGNITION, 2010, 43 (04) :1518-1530
[5]  
Erol B., 2008, P 16 ACM INT C MULTI, P399
[6]   Adaptive binarization method for document image analysis [J].
Feng, ML ;
Tan, YP .
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, :339-342
[7]   A novel variational model for noise robust document image binarization [J].
Feng, Shu .
NEUROCOMPUTING, 2019, 325 :288-302
[8]   Adaptive degraded document image binarization [J].
Gatos, B ;
Pratikakis, I ;
Perantonis, SJ .
PATTERN RECOGNITION, 2006, 39 (03) :317-327
[9]   A NEW METHOD FOR GRAY-LEVEL PICTURE THRESHOLDING USING THE ENTROPY OF THE HISTOGRAM [J].
KAPUR, JN ;
SAHOO, PK ;
WONG, AKC .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 29 (03) :273-285
[10]   Improved Degraded Document Image Binarization Using Median Filter for Background Estimation [J].
Khitas, Mehdi ;
Ziet, Lahcene ;
Bouguezel, Saad .
ELEKTRONIKA IR ELEKTROTECHNIKA, 2018, 24 (03) :82-87