Colour text segmentation in web images based on human perception

被引:24
作者
Karatzas, D.
Antonacopoulos, A. [1 ]
机构
[1] Univ Salford, PRImA Res Lab, Sch Comp Sci & Engn, Salford M5 4WT, Lancs, England
[2] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
关键词
web document image analysis; colour document analysis; character segmentation; text segmentation; colour images;
D O I
10.1016/j.imavis.2006.05.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a significant need to extract and analyse the text in images on Web documents, for effective indexing, semantic analysis and even presentation by non-visual means (e.g., audio). This paper argues that the challenging segmentation stage for such images benefits from a human perspective of colour perception in preference to RGB colour space analysis. The proposed approach enables the segmentation of text in complex situations such as in the presence of varying colour and texture (characters and background). More precisely, characters are segmented as distinct regions with separate chromaticity and/or lightness by performing a layer decomposition of the image. The method described here is a result of the authors' systematic approach to approximate the human colour perception characteristics for the identification of character regions. In this instance, the image is decomposed by performing histogram analysis of Hue and Lightness in the HLS colour space and merging using information on human discrimination of wavelength and luminance. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:564 / 577
页数:14
相关论文
共 22 条
[1]  
[Anonymous], P ACM INT C DIG LIB
[2]   Page segmentation using the description of the background [J].
Antonacopoulos, A .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 70 (03) :350-369
[3]  
Antonacopoulos A, 2001, P SOC PHOTO-OPT INS, V4311, P198
[4]  
ANTONACOPOULOS A, 2003, WEB DOCUMENT ANAL CH
[5]  
ANTONACOPOULOS A, 1999, VISUAL REPRESENTATIO
[6]  
ANTONACOPOULOS A, 2000, P 4 IAPR WORKSH DOC, P515
[7]  
BEDFORD RE, 1958, J OPT SOC AM, V48
[8]  
Brown M.K., 2001, P 1 INT WORKSH WEB D, P59
[9]   Recognising text in real scenes [J].
Clark P. ;
Mirmehdi M. .
International Journal on Document Analysis and Recognition, 2002, 4 (4) :243-257
[10]   Automatic text location in images and video frames [J].
Jain, AK ;
Yu, B .
PATTERN RECOGNITION, 1998, 31 (12) :2055-2076