DETEXTIVE optical character recognition with pattern matching on-the-fly

被引:6
作者
Caluori, Ursina [1 ]
Simon, Klaus [1 ]
机构
[1] EMPA, Swiss Fed Labs Mat Testing & Res, Dubendorf, Switzerland
关键词
OCR; Pattern matching; Historic prints; Black letter fonts; Mass-digitization; Performance test;
D O I
10.1016/j.patcog.2014.08.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new OCR-concept designed for the requirements of historic prints in the context of mass-digitizations. The core part is the glyph recognition, based on pattern matching with patterns that are derived from computer font glyphs and are generated on-the-fly. The classification of a sample is organized as a search process for the most similar glyph pattern. This results in consistently good hit rates for arbitrary fonts without any training. In particular, we investigate the performance of our prototype in comparison to popular commercially available OCR-software. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:827 / 836
页数:10
相关论文
共 21 条
[1]  
[Anonymous], OPTICAL CHARACTER RE
[2]  
[Anonymous], ARCHIVING 2013 2 5 A
[3]  
[Anonymous], FEATURE EXTRACTION A
[4]  
[Anonymous], POSTSCRIPT EXAMPLE
[5]  
[Anonymous], DIE POSTSCRIPT PDF B
[6]  
[Anonymous], 21 INT C SOFTW TEL C
[7]  
[Anonymous], ARCHIVING 2013 2 5 A
[8]  
[Anonymous], OPTICAL CHARACTER RE
[9]  
[Anonymous], THESIS LINKOPING U
[10]  
[Anonymous], 2000, Pattern Classification