YinYang, a Fast and Robust Adaptive Document Image Binarization for Optical Character Recognition

被引:3
作者
Bloechle, Jean-Luc [1 ]
Hennebert, Jean [2 ]
Gisler, Christophe [2 ]
机构
[1] Univ Fribourg, CoPeLab Grp, Fac Sci & Med, Fribourg, Switzerland
[2] Univ Appl Sci Western Switzerland, iCoSys Inst, Fribourg, Switzerland
来源
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023 | 2023年
关键词
binarization; image thresholding; image processing; OCR; COMBINATION;
D O I
10.1145/3573128.3609354
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Character Recognition (OCR) from document photos taken by cell phones is a challenging task. Most OCR methods require prior binarization of the image, which can be difficult to achieve when documents are captured with various mobile devices in unknown lighting conditions. For example, shadows cast by the camera or the camera holder on a hard copy can jeopardize the binarization process and hinder the next OCR step. In the case of highly uneven illumination, binarization methods using global thresholding simply fail, and state-of-the-art adaptive algorithms often deliver unsatisfactory results. In this paper, we present a new binarization algorithm using two complementary local adaptive passes and taking advantage of the color components to improve results over current image binarization methods. The proposed approach gave remarkable results at the DocEng'22 competition on the binarization of photographed documents.
引用
收藏
页数:4
相关论文
共 13 条
[1]   Optimal combination of document binarization techniques using a self-organizing map neural network [J].
Badekas, E. ;
Papamarkos, N. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2007, 20 (01) :11-24
[2]   A Quality, Size and Time Assessment of the Binarization of Documents Photographed by Smartphones [J].
Bernardino, Rodrigo ;
Lins, Rafael Dueire ;
Barboza, Ricardo da Silva .
JOURNAL OF IMAGING, 2023, 9 (02)
[3]  
Bradley Derek, 2007, Journal of Graphics Tools, V12, P13
[4]  
Dueire Lins Rafael, 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR). Proceedings, P1539, DOI 10.1109/ICDAR.2019.00248
[5]   Discrete CRF based combination framework for document image binarization [J].
Hebert, David ;
Nicolas, Stephane ;
Paquet, Thierry .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1165-1169
[6]   CUBIC CONVOLUTION INTERPOLATION FOR DIGITAL IMAGE-PROCESSING [J].
KEYS, RG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (06) :1153-1160
[7]   The Winner Takes It All: Choosing the "best" Binarization Algorithm for Photographed Documents [J].
Lins, Rafael Dueire ;
Bernardino, Rodrigo Barros ;
Barboza, Ricardo ;
Oliveira, Raimundo .
DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 :48-64
[8]   ICDAR 2021 Competition on Time-Quality Document Image Binarization [J].
Lins, Rafael Dueire ;
Bernardino, Rodrigo Barros ;
Smith, Elisa Barney ;
Kavallieratou, Ergina .
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 :708-722
[9]   Binarisation of Photographed Documents Image Quality and Processing Time Assessment [J].
Lins, Rafael Dueire ;
Simske, Steven J. ;
Bernardino, Rodrigo Barros .
PROCEEDINGS OF THE 21ST ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG '21), 2021,
[10]   AdOtsu: An adaptive and parameterless generalization of Otsu's method for document image binarization [J].
Moghaddam, Reza Farrahi ;
Cheriet, Mohamed .
PATTERN RECOGNITION, 2012, 45 (06) :2419-2431