A multi-plane approach for text segmentation of complex document images

被引:26
作者
Chen, Yen-Lin [2 ]
Wu, Bing-Fei [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu 30010, Taiwan
[2] Asia Univ, Dept Comp Sci & Informat Engn, Taichung 41354, Taiwan
关键词
Document image processing; Text extraction; Image segmentation; Multilevel thresholding; Region segmentation; Complex document images; SKEW DETECTION; BINARIZATION; EXTRACTION; ROBUST; ALGORITHM; MOUNTAIN; SYSTEM;
D O I
10.1016/j.patcog.2008.10.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study presents a new method, namely the multiplane segmentation approach, for segmenting and extracting textual objects from various real-life complex document images. The proposed multi-plane segmentation approach first decomposes the document image into distinct object planes to extract and separate homogeneous objects including textual regions of interest, non-text objects such as graphics and pictures, and background textures. This process consists of two stages-localized histogram multilevel thresholding and multi-plane region matching and assembling. Then a text extraction procedure is applied Oil the resultant planes to detect and extract textual objects with different characteristics in the respective planes. The proposed approach processes document images regionally and adaptively according to their respective local features. Hence detailed characteristics of the extracted textual objects, Particularly small characters with thin strokes, as well as gradational illuminations of characters, can be well-preserved. Moreover, this way also allows background objects with uneven, gradational, and sharp variations in contrast, illumination, and texture to be handled easily and well. Experimental results on real-life complex document images demonstrate that the proposed approach is effective in extracting textual objects with Various illuminations, sizes, and font styles from various types of complex document images. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1419 / 1444
页数:26
相关论文
共 45 条
[1]   A ROBUST SYSTEM FOR THRESHOLDING AND SKEW DETECTION IN MIXED TEXT/GRAPHICS DOCUMENTS [J].
Amin, Adnan ;
Wu, Sue .
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2005, 5 (02) :247-265
[2]  
BURKE H, 1997, HDB CHARACTER RECOGN
[3]   A recursive thresholding technique for image segmentation [J].
Cheriet, M ;
Said, JN ;
Suen, CY .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (06) :918-921
[4]  
CHIU SL, 1995, P 6 INT FUZZ SYST AS, P1
[5]   Estimation of skew angles for scanned documents based on piecewise covering by parallelograms [J].
Chou, Chien-Hsing ;
Chu, Shih-Yu ;
Chang, Fu .
PATTERN RECOGNITION, 2007, 40 (02) :443-455
[6]   Iterative multimodel subimage binarization for handwritten character segmentation [J].
Dawoud, A ;
Kamel, MS .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (09) :1223-1230
[7]   The indexing and retrieval of document images: A survey [J].
Doermann, D .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 70 (03) :287-298
[8]  
Fisher J. L., 1990, Proceedings. 10th International Conference on Pattern Recognition (Cat. No.90CH2898-5), P567, DOI 10.1109/ICPR.1990.118166
[9]   A ROBUST ALGORITHM FOR TEXT STRING SEPARATION FROM MIXED TEXT GRAPHICS IMAGES [J].
FLETCHER, LA ;
KASTURI, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (06) :910-918
[10]   Skew detection and text line position determination in digitized documents [J].
Gatos, B ;
Papamarkos, N ;
Chamzas, C .
PATTERN RECOGNITION, 1997, 30 (09) :1505-1519