A Coarse-to-Fine Word Spotting Approach for Historical Handwritten Documents Based on Graph Embedding and Graph Edit Distance

被引:20
|
作者
Wang, Peng [1 ,2 ]
Eglin, Veronique [1 ]
Garcia, Christophe [1 ]
Largeron, Christine [2 ]
Llados, Josep [3 ]
Fornes, Alicia [3 ]
机构
[1] Inst Natl Sci Appl, CNRS, UMR5205, LIRIS, F-69621 Villeurbanne, France
[2] Univ St Etienne, CNRS, UMR5516, LAHC, F-42023 St Etienne, France
[3] Univ Autonoma Barcelona, Comp Vis Ctr, Bellaterra 08196, Spain
来源
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2014年
关键词
word spotting; coarse-to-fine mechamism; graph-based representation; graph embedding; graph edit distance; REPRESENTATION;
D O I
10.1109/ICPR.2014.530
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective information retrieval on handwritten document images has always been a challenging task, especially historical ones. In the paper, we propose a coarse-to-fine handwritten word spotting approach based on graph representation. The presented model comprises both the topological and morphological signatures of the handwriting. Skeleton-based graphs with the Shape Context labelled vertexes are established for connected components. Each word image is represented as a sequence of graphs. Aiming at developing a practical and efficient word spotting approach for large-scale historical handwritten documents, a fast and coarse comparison is first applied to prune the regions that are not similar to the query based on the graph embedding methodology. Afterwards, the query and regions of interest are compared by graph edit distance based on the Dynamic Time Warping alignment. The proposed approach is evaluated on a public dataset containing 50 pages of historical marriage license records. The results show that the proposed approach achieves a compromise between efficiency and accuracy.
引用
收藏
页码:3074 / 3079
页数:6
相关论文
共 26 条
  • [1] A Coarse-to-Fine Approach for Handwritten Word Spotting in Large Scale Historical Documents Collection
    Almazan, J.
    Fernandez, D.
    Fornes, A.
    Llados, J.
    Valveny, E.
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 455 - 460
  • [2] Graph Based Keyword Spotting in Handwritten Historical Slavic Documents
    Riesen, Kaspar
    Brodic, Darko
    ERCIM NEWS, 2013, (95): : 37 - 38
  • [3] Keyword spotting in historical handwritten documents based on graph matching
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    PATTERN RECOGNITION, 2018, 81 : 240 - 253
  • [4] Graph-Based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 564 - 573
  • [5] Filters for graph-based keyword spotting in historical handwritten documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    PATTERN RECOGNITION LETTERS, 2020, 134 : 125 - 134
  • [6] Ensembles for Graph-based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 714 - 720
  • [7] Graph-Based Keyword Spotting in Historical Documents Using Context-Aware Hausdorff Edit Distance
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 49 - 54
  • [8] Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION (GBRPR 2017), 2017, 10310 : 83 - 93
  • [9] Graph-based keyword spotting in historical manuscripts using Hausdorff edit distance
    Ameri, Mohammad Reza
    Stauffer, Michael
    Riesen, Kaspar
    Bui, Tien D.
    Fischer, Andreas
    PATTERN RECOGNITION LETTERS, 2019, 121 : 61 - 67
  • [10] Cross-Evaluation of Graph-Based Keyword Spotting in Handwritten Historical Documents
    Stauffer, Michael
    Maergner, Paul
    Fischer, Andreas
    Riesen, Kaspar
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019, 2019, 11510 : 45 - 55