Restoring ink bleed-through degraded document images using a recursive unsupervised classification technique

被引:0
作者
Fadoua, D [1 ]
Le Bourgeois, F [1 ]
Emptoz, H [1 ]
机构
[1] Inst Natl Sci Appl, LIRIS, F-69621 Villeurbanne, France
来源
DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS | 2006年 / 3872卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new method to restore a particular type of degradation related to ancient document images. This degradation, referred to as "bleed-through", is due to the paper porosity, the chemical quality of the ink, or the conditions of digitalization. It appears as marks degrading the readability of the document image. Our purpose consists then in removing these marks to improve readability. The proposed method is based on a recursive unsupervised segmentation approach applied on the decorrelated data space by the principal component analysis. It generates a binary tree that only the leaves images satisfying a certain condition on their logarithmic histogram are processed. Some experiments, done on real ancient document images provided by the archives of "Chatillon-Chalaronne" illustrate the effectiveness of the suggested method.
引用
收藏
页码:38 / 49
页数:12
相关论文
共 13 条
[1]  
BAIRD HS, 2000, IAPR 2000 WORKSH DOC
[2]  
CHRIS D, 2004, P INT C MACH LEARN
[3]  
Dubois E, 2001, PICS 2001: IMAGE PROCESSING, IMAGE QUALITY, IMAGE CAPTURE, SYSTEMS CONFERENCE, PROCEEDINGS, P177
[4]  
GATOS B, 2004, 6 INT WORKSH DAS2004, P102
[5]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[6]   Separating text and background in degraded document images - A comparison of global thresholding techniques for multi-stage thresholding [J].
Leedham, G ;
Varma, S ;
Patankar, A ;
Govindaraju, V .
EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, :244-249
[7]  
Leydier Y, 2004, LECT NOTES COMPUT SC, V3163, P252
[8]   Cancellation of show-through in duplex scanning [J].
Sharma, G .
2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, :609-612
[9]  
SMIGIEL E, 2004, 6 INT WORKSH, P125
[10]  
Tan CL, 2002, IEEE T PATTERN ANAL, V24, P1399, DOI 10.1109/TPAMI.2002.1039211