Historical Document Binarization Combining Semantic Labeling and Graph Cuts

被引:3
作者
Ayyalasomayajula, Kalyan Ram [1 ]
Brun, Anders [1 ]
机构
[1] Uppsala Univ, Dept Informat Technol, Ctr Image Anal, Uppsala, Sweden
来源
IMAGE ANALYSIS, SCIA 2017, PT I | 2017年 / 10269卷
基金
瑞典研究理事会;
关键词
Binarization; Semantic labeling; Deep learning; Graph cut; Zero shot learning; LAPLACIAN ENERGY;
D O I
10.1007/978-3-319-59126-1_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most data mining applications on collections of historical documents require binarization of the digitized images as a preprocessing step. Historical documents are often subjected to degradations such as parchment aging, smudges and bleed through from the other side. The text is sometimes printed, but more often handwritten. Mathematical modeling of appearance of the text, background and all kinds of degradations, is challenging. In the current work we try to tackle binarization as pixel classification problem. We first apply semantic segmentation, using fully convolutional neural networks. In order to improve the sharpness of the result, we then apply a graph cut algorithm. The labels from the semantic segmentation are used as approximate estimates of the text and background, with the probability map of background used for pruning the edges in the graph cut. The results obtained show significant improvement over the state of the art approach.
引用
收藏
页码:386 / 396
页数:11
相关论文
共 15 条
[1]  
[Anonymous], 2014, ACM INT C MULTIMEDIA
[2]  
[Anonymous], 2016, ARXIV160506211
[3]  
[Anonymous], 2011, INT C DOC AN REC
[4]   Document binarization using topological clustering guided Laplacian Energy Segmentation [J].
Ayyalasomayajula, Kalyan Ram ;
Brun, Anders .
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, :523-528
[5]   Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents [J].
Bar-Yosef, Itay ;
Beckman, Isaac ;
Kedem, Klara ;
Dinstein, Itshak .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :89-99
[6]   An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision [J].
Boykov, Y ;
Kolmogorov, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) :1124-1137
[7]   Document binarization with automatic parameter tuning [J].
Howe, Nicholas R. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2013, 16 (03) :247-258
[8]   A Laplacian Energy for Document Binarization [J].
Howe, Nicholas R. .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :6-10
[9]   Super-resolved binarization of text based on the FAIR algorithm. [J].
Lelore, Thibault ;
Bouchara, Frederic .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :839-843
[10]   Distance-reciprocal distortion measure for binary document images [J].
Lu, HP ;
Kot, AC ;
Shi, YQ .
IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (02) :228-231