A Multiple-Label Guided Clustering Algorithm for Historical Document Dating and Localization

被引:19
作者
He, Sheng [1 ]
Samara, Petros [2 ]
Burgers, Jan [2 ,3 ,4 ]
Schomaker, Lambert [1 ]
机构
[1] Univ Groningen, Inst Artificial Intelligence & Cognit Engn, NL-9700 AK Groningen, Netherlands
[2] Huygens Inst Hist Netherlands, NL-2509 LT The Hague, Netherlands
[3] Royal Netherlands Acad Arts & Sci, NL-2509 LT The Hague, Netherlands
[4] Univ Amsterdam, Dept Hist, NL-1012 CX Amsterdam, Netherlands
关键词
Historical document dating; historical document localization; histogram of orientations of handwritten stroke; multi-label self-organizing map; WRITER IDENTIFICATION; AGE ESTIMATION; HANDWRITTEN; CLASSIFICATION; HISTOGRAMS;
D O I
10.1109/TIP.2016.2602078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is of essential importance for historians to know the date and place of origin of the documents they study. It would be a huge advancement for historical scholars if it would be possible to automatically estimate the geographical and temporal provenance of a handwritten document by inferring them from the handwriting style of such a document. We propose a multiple-label guided clustering algorithm to discover the correlations between the concrete low-level visual elements in historical documents and abstract labels, such as date and location. First, a novel descriptor, called histogram of orientations of handwritten strokes, is proposed to extract and describe the visual elements, which is built on a scale-invariant polar-feature space. In addition, the multi-label self-organizing map (MLSOM) is proposed to discover the correlations between the low-level visual elements and their labels in a single framework. Our proposed MLSOM can be used to predict the labels directly. Moreover, the MLSOM can also be considered as a pre-structured clustering method to build a codebook, which contains more discriminative information on date and geography. The experimental results on the medieval paleographic scale data set demonstrate that our method achieves state-of-the-art results.
引用
收藏
页码:5252 / 5265
页数:14
相关论文
共 44 条
[1]  
[Anonymous], P 3 INT WORKSH HIST
[2]  
[Anonymous], 2007, 2007 IEEE C COMPUTER, DOI DOI 10.1109/CVPR.2007.383198
[3]  
[Anonymous], 2015, P 3 INT WORKSH HIST, DOI DOI 10.1145/2809544.2809560
[4]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[5]   Writer identification using directional ink-trace width measurements [J].
Brink, A. A. ;
Smit, J. ;
Bulacu, M. L. ;
Schomaker, L. R. B. .
PATTERN RECOGNITION, 2012, 45 (01) :162-171
[6]   Text-independent writer identification and verification using textural and allographic features [J].
Bulacu, Marius ;
Schomaker, Lambert .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (04) :701-717
[7]   A Learning Framework for Age Rank Estimation Based on Face Images With Scattering Transform [J].
Chang, Kuang-Yu ;
Chen, Chu-Song .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (03) :785-798
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   What Makes Paris Look like Paris? [J].
Doersch, Carl ;
Singh, Saurabh ;
Gupta, Abhinav ;
Sivic, Josef ;
Efros, Alexei A. .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)
[10]  
Fei-Fei L, 2005, PROC CVPR IEEE, P524