Towards style-based dating of historical documents

被引:40
作者
He, Sheng [1 ]
Samara, Petros [2 ]
Burgers, Jan [3 ]
Schomaker, Lambert [1 ]
机构
[1] Univ Groningen, NL-9700 AB Groningen, Netherlands
[2] Inst Nederlandse Geschiedenis, The Hague, Netherlands
[3] Univ Amsterdam, NL-1012 WX Amsterdam, Netherlands
来源
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2014年
关键词
Medieval Paleographic Scale; historical document dating; age estimation; global and local regression; AGE ESTIMATION;
D O I
10.1109/ICFHR.2014.52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the date of undated medieval manuscripts by evaluating the script they contain, using document image analysis, is helpful for scholars of various disciplines studying the Middle Ages. However, there are, as yet, no systems to automatically and effectively infer the age of historical scripts using machine learning methods. To build a system to date medieval documents is a challenging problem in several aspects: 1) As yet, no suitable reference dataset of medieval handwriting exists; 2) relatively little is known about the evolution of writing styles in the Middle Ages, and especially in the later Middle Ages. Our Medieval Paleographic Scale (MPS) project aims at solving these problems. We have collected a corpus of charters from the Medieval Dutch language area, dating from the period 1300 to 1550. A global and local regression method is proposed for learning and estimating the year in which these documents were written, using several features which have been successfully used in writer identification. The proposed system can serve as a bask tool for the medievalist or paleographer. The experimental results of the proposed method demonstrate its effectiveness.
引用
收藏
页码:265 / 270
页数:6
相关论文
共 14 条
[1]   Writer identification using directional ink-trace width measurements [J].
Brink, A. A. ;
Smit, J. ;
Bulacu, M. L. ;
Schomaker, L. R. B. .
PATTERN RECOGNITION, 2012, 45 (01) :162-171
[2]   Text-independent writer identification and verification using textural and allographic features [J].
Bulacu, Marius ;
Schomaker, Lambert .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (04) :701-717
[3]  
Chen K., 2013, COMPUTER VISION PATT
[4]   Automatic age estimation based on facial aging patterns [J].
Geng, Xin ;
Zhou, Zhi-Hua ;
Smith-Miles, Kate .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (12) :2234-2240
[5]  
Gumbert J.Peter., 1976, Essays Presented to G. I. Lieftinck. Vol. 4, Miniatures, Scripts, V4, P45
[6]   Image-based human age estimation by manifold learning and locally adjusted robust regression [J].
Guo, Guodong ;
Fu, Yun ;
Dyer, Charles R. ;
Huang, Thomas S. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (07) :1178-1188
[7]   Delta-n Hinge: rotation-invariant features for writer identification [J].
He, Sheng ;
Schomaker, Lambert .
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, :2023-2028
[8]  
Palermo Frank, 2012, Computer Vision - ECCV 2012. Proceedings of the 12th European Conference on Computer Vision, P499, DOI 10.1007/978-3-642-33783-3_36
[9]   Using codebooks of fragmented connected-component contours in forensic and historic writer identification [J].
Schomaker, Lambert ;
Franke, Katrin ;
Bulacu, Marius .
PATTERN RECOGNITION LETTERS, 2007, 28 (06) :719-727
[10]  
Stokes P. A., 2012, DIGITAL HUMANITIES 2, P382