A Quality, Size and Time Assessment of the Binarization of Documents Photographed by Smartphones

被引:6
作者
Bernardino, Rodrigo [1 ]
Lins, Rafael Dueire [1 ,2 ,3 ]
Barboza, Ricardo da Silva [3 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, BR-50670901 Recife, PE, Brazil
[2] Univ Fed Rural Pernambuco, Dept Comp, BR-55815060 Recife, PE, Brazil
[3] Univ Estado Amazonas, Escola Super Tecnol, Coordenacao Engn Comp, BR-69410000 Manaus, AM, Brazil
关键词
document binarization; photographed documents; DIB-dataset; smartphone; binarization algorithms; IMAGE BINARIZATION; ALGORITHM; ENTROPY;
D O I
10.3390/jimaging9020041
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Smartphones with an in-built camera are omnipresent today in the life of over eighty percent of the world's population. They are very often used to photograph documents. Document binarization is a key process in many document processing platforms. This paper assesses the quality, file size and time performance of sixty-eight binarization algorithms using five different versions of the input images. The evaluation dataset is composed of deskjet, laser and offset printed documents, photographed using six widely-used mobile devices with the strobe flash off and on, under two different angles and four shots with small variations in the position. Besides that, this paper also pinpoints the algorithms per device that may provide the best visual quality-time, document transcription accuracy-time, and size-time trade-offs. Furthermore, an indication is also given on the "overall winner" that would be the algorithm of choice if one has to use one algorithm for a smartphone-embedded application.
引用
收藏
页数:24
相关论文
共 73 条
[1]  
Akbari Younes, 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR). Proceedings, P973, DOI 10.1109/ICDAR.2019.00160
[2]   An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows [J].
Bataineh, Bilal ;
Abdullah, Siti Norul Huda Sheikh ;
Omar, Khairuddin .
PATTERN RECOGNITION LETTERS, 2011, 32 (14) :1805-1813
[3]   A non-parametric binarization method based on ensemble of clustering algorithms [J].
Bera, Suman Kumar ;
Ghosh, Soulib ;
Bhowmik, Showmik ;
Sarkar, Ram ;
Nasipuri, Mita .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (05) :7653-7673
[4]  
Bernsen J., 1986, ICPR 86, P1251
[5]  
Bradley Derek, 2007, Journal of Graphics Tools, V12, P13
[6]   A selectional auto-encoder approach for document image binarization [J].
Calvo-Zaragoza, Jorge ;
Gallego, Antonio-Javier .
PATTERN RECOGNITION, 2019, 86 :37-47
[7]  
da Silva J. M. M., 2006, Applied Computing 2006. 21st Annual ACM Symposium on Applied Computing, P853, DOI 10.1145/1141277.1141471
[8]  
Doermann D, 2003, PROC INT CONF DOC, P606
[9]   OPERATIONS USEFUL FOR SIMILARITY-INVARIANT PATTERN RECOGNITION [J].
DOYLE, W .
JOURNAL OF THE ACM, 1962, 9 (03) :259-&
[10]  
Dueire Lins Rafael, 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR). Proceedings, P1539, DOI 10.1109/ICDAR.2019.00248