Re-Typograph Phase I: a Proof-of-Concept for Typeface Parameter Extraction from Historical Documents

被引:2
作者
Lamiroy, Bart [1 ]
Bouville, Thomas [2 ]
Blegean, Julien [3 ]
Cao, Hongliu [4 ]
Ghamizi, Salah [4 ]
Houpin, Romain [3 ]
Lloyd, Matthias [4 ]
机构
[1] Univ Lorraine, LORIA UMR 7503, F-54506 Vandoeuvre Les Nancy, France
[2] Ecole Natl Super Art Nancy, Atelier Natl Rech Typograph, F-54013 Nancy, France
[3] Univ Lorraine Telecom Nancy, Nancy, France
[4] Univ Lorraine Mines Nancy, Nancy, France
来源
DOCUMENT RECOGNITION AND RETRIEVAL XXII | 2015年 / 9402卷
关键词
FONT; RECOGNITION; ALGORITHM; BOOKS;
D O I
10.1117/12.2075813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper reports on the first phase of an attempt to create a full retro-engineering pipeline that aims to construct a complete set of coherent typographic parameters defining the typefaces used in a printed homogenous text. It should be stressed that this process cannot reasonably be expected to be fully automatic and that it is designed to include human interaction. Although font design is governed by a set of quite robust and formal geometric rulesets, it still heavily relies on subjective human interpretation. Furthermore, different parameters, applied to the generic rulesets may actually result in quite similar and visually difficult to distinguish typefaces, making the retro-engineering an inverse problem that is ill conditioned once shape distortions (related to the printing and/or scanning process) come into play. This work is the first phase of a long iterative process, in which we will progressively study and assess the techniques from the state-of-the-art that are most suited to our problem and investigate new directions when they prove to not quite adequate. As a first step, this is more of a feasibility proof-of-concept, that will allow us to clearly pinpoint the items that will require more in-depth research over the next iterations.
引用
收藏
页数:12
相关论文
共 27 条
  • [1] [Anonymous], 2003, Potrace: a polygon-based tracing algorithm
  • [2] Blokland E Van, 2014, AUTOMATIC TYPE DESIG
  • [3] Brogain S. O, 1983, PROFESSIONAL PRINTER, V27, P9
  • [4] Cao H., 2014, HDB DOCUMENT IMAGE P, P331
  • [5] Extraction of the Euclidean skeleton based on a connectivity criterion
    Choi, WP
    Lam, KM
    Siu, WC
    [J]. PATTERN RECOGNITION, 2003, 36 (03) : 721 - 729
  • [6] de Montaigne M, 1635, ESSAIS
  • [7] Skeletonization algorithm running on path-based distance maps
    diBaja, GS
    Thiel, E
    [J]. IMAGE AND VISION COMPUTING, 1996, 14 (01) : 47 - 57
  • [8] Haralambous Y., 2007, OREILLY SERIES
  • [9] Hassan T, 2010, DOCENG2010: PROCEEDINGS OF THE 2010 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, P181
  • [10] Herz I, 1997, THESIS ECOLE POLYTEC