Keyword spotting in historical handwritten documents based on graph matching

被引:17
作者
Stauffer, Michael [1 ,4 ]
Fischer, Andreas [2 ,3 ]
Riesen, Kaspar [1 ]
机构
[1] Univ Appl Sci & Arts Northwestern Switzerland, Inst Informat Syst, CH-4600 Olten, Switzerland
[2] Univ Fribourg, Dept Informat, CH-1700 Fribourg, Switzerland
[3] Univ Appl Sci & Arts Western Switzerland, Inst Complex Syst, CH-1705 Fribourg, Switzerland
[4] Univ Pretoria, Dept Informat, Pretoria, South Africa
关键词
Handwritten keyword spotting; Graph representation; Bipartite graph matching; Ensemble methods; WORD; RECOGNITION; ALGORITHM; MODELS;
D O I
10.1016/j.patcog.2018.04.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last decades historical handwritten documents have become increasingly available in digital form. Yet, the accessibility to these documents with respect to browsing and searching remained limited as full automatic transcription is often not possible or not sufficiently accurate. This paper proposes a novel reliable approach for template-based keyword spotting in historical handwritten documents. In particular, our framework makes use of different graph representations for segmented word images and a sophisticated matching procedure. Moreover, we extend our method to a spotting ensemble. In an exhaustive experimental evaluation on four widely used benchmark datasets we show that the proposed approach is able to keep up or even outperform several state-of-the-art methods for template- and learning-based keyword spotting. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:240 / 253
页数:14
相关论文
共 67 条
[1]   Word matching using single closed contours for indexing handwritten historical documents [J].
Adamek, Tornasz ;
O'Connor, Noel E. ;
Smeaton, Alan F. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :153-165
[2]  
ALDAVERT D, 2013, INT C DOC AN REC, P511, DOI DOI 10.1109/ICDAR.2013.108
[3]   Word Spotting and Recognition with Embedded Attributes [J].
Almazan, Jon ;
Gordo, Albert ;
Fornes, Alicia ;
Valveny, Ernest .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) :2552-2566
[4]   Segmentation-free word spotting with exemplar SVMs [J].
Almazan, Jon ;
Gordo, Albert ;
Fornes, Alicia ;
Valveny, Ernest .
PATTERN RECOGNITION, 2014, 47 (12) :3967-3978
[5]  
Amed M. R., 2017, INT GRAPH SOC C
[6]  
[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264
[7]  
[Anonymous], 2008, Proc. ICFHR
[8]   Special issue on the analysis of historical documents [J].
Antonacopoulos, Apostolos ;
Downton, Andy C. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :75-77
[9]   Efficient matching and indexing of graph models in content-based retrieval [J].
Berretti, S ;
Del Bimbo, A ;
Vicario, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (10) :1089-1105
[10]  
Bin Zhang, 2003, Proceedings of the SPIE - The International Society for Optical Engineering, V5296, P45, DOI 10.1117/12.523968