A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images

被引:39
作者
Hedjam, Rachid [1 ]
Moghaddam, Reza Farrahi [1 ]
Cheriet, Mohamed [1 ]
机构
[1] Ecole Technol Super, Synchromedia Lab Multimedia Commun Telepresence, Montreal, PQ H3C 1K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Historical and degraded documents; Document images binarization; Adaptive local document image classification; CLASSIFICATION;
D O I
10.1016/j.patcog.2011.02.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an adaptive method for the binarization of historical manuscripts and degraded document images. The proposed approach is based on maximum likelihood (ML) classification and uses a priori information and the spatial relationship on the image domain. In contrast with many conventional methods that use a decision based on thresholding, the proposed method performs a soft decision based on a probabilistic model. The main idea is that, from an initialization map (under-binarization) containing only the darkest part of the text, the method is able to recover the main text in the document image, including low-intensity and weak strokes. To do so, fast and robust local estimation of text and background features is obtained using grid-based modeling and inpainting techniques; then, the ML classification is performed to classify pixels into black and white classes. The advantage of the proposed method is that it preserves weak connections and provides smooth and continuous strokes, thanks to its correlation-based nature. Performance is evaluated both subjectively and objectively against standard databases. The proposed method outperforms the state-of-the-art methods presented in the DIBCO'09 binarization contest, although those other methods provide performance close to it. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2184 / 2196
页数:13
相关论文
共 44 条
[1]  
[Anonymous], 2008, IEEE SIGNAL PROC JUL, V25
[2]  
[Anonymous], 2000, Pattern Classification
[3]   Special issue on the analysis of historical documents [J].
Antonacopoulos, Apostolos ;
Downton, Andy C. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :75-77
[4]  
Bernsen J., 1986, 8 INT C PATT REC
[5]   Image inpainting [J].
Bertalmio, M ;
Sapiro, G ;
Caselles, V ;
Ballester, C .
SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, :417-424
[6]  
Chen Q., 2008, PATTERN RECOGNITION, V41
[7]   A recursive thresholding technique for image segmentation [J].
Cheriet, M ;
Said, JN ;
Suen, CY .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (06) :918-921
[8]  
David B., 2006, NATURE, V439, P358
[9]  
Deriche R., 1996, 2697 INRIA
[10]  
Drira F., 2007, THESIS LIRIS