Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten Documents

被引:3
作者
Dias, Mariana [1 ,2 ]
Lopes, Carla Teixeira [1 ,2 ]
机构
[1] Univ Porto, Fac Engn, Rua Dr Roberto Frias, P-4200465 Porto, Portugal
[2] INESC TEC, Rua Dr Roberto Frias, P-4200465 Porto, Portugal
来源
ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE | 2023年 / 16卷 / 04期
关键词
Image processing; optical character recognition; parameter tuning; cultural heritage; BINARIZATION;
D O I
10.1145/3606705
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Linked data is used in various fields as a new way of structuring and connecting data. Cultural heritage institutions have been using linked data to improve archival descriptions and facilitate the discovery of information. Most archival records have digital representations of physical artifacts in the form of scanned images that are non-machine-readable. Optical Character Recognition (OCR) recognizes text in images and translates it into machine-encoded text. This article evaluates the impact of image processing methods and parameter tuning in OCR applied to typewritten cultural heritage documents. The approach uses a multi-objective problem formulation to minimize Levenshtein edit distance and maximize the number of words correctly identified with a non-dominated sorting genetic algorithm (NSGA-II) to tune the methods' parameters. Evaluation results show that parameterization by digital representation typology benefits the performance of image pre-processing algorithms in OCR. Furthermore, our findings suggest that employing image pre-processing algorithms in OCR might be more suitable for typologies where the text recognition task without pre-processing does not produce good results. In particular, Adaptive Thresholding, Bilateral Filter, and Opening are the best-performing algorithms for the theater plays' covers, letters, and overall dataset, respectively, and should be applied before OCR to improve its performance.
引用
收藏
页数:25
相关论文
共 47 条
[1]  
EA, 2015, Signal & Image Processing An International Journal, V6, P01, DOI [10.5121/sipij.2015.6401, 10.5121/sipij.2015.6401, DOI 10.5121/SIPIJ.2015.6401]
[2]  
[Anonymous], 1996, Proceedings of the Fourth Workshop on Very Large Corpora
[3]   ICDAR2013 Competition on Historical Book Recognition-HBR2013 [J].
Antonacopoulos, A. ;
Clausner, C. ;
Papadopoulos, C. ;
Pletschacher, S. .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1459-1463
[4]  
Baierer Konstantin, 2020, Models OCROPUS/ocropy wiki
[5]   Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[6]   Pymoo: Multi-Objective Optimization in Python']Python [J].
Blank, Julian ;
Deb, Kalyanmoy .
IEEE ACCESS, 2020, 8 :89497-89509
[7]   Linked Data for Libraries, Archives and Museums: How to Clean, Link and Publish Your Metadata [J].
Bromage, David .
Archives and Manuscripts, 2016, 44 (03) :172-174
[8]   Selecting automatically pre-processing methods to improve OCR performances [J].
Bui, Quang Anh ;
Mollard, David ;
Tabbone, Salvatore .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :169-174
[9]  
Carrasco R.C., 2014, P 1 INT C DIG ACC TE, P179, DOI [10.1145/2595188.2595221, DOI 10.1145/2595188.2595221]
[10]   The Significance of Reading Order in Document Recognition and its Evaluation [J].
Clausner, C. ;
Pletschacher, S. ;
Antonacopoulos, A. .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :688-692