Text Detection in Natural Scene Images by Stroke Gabor Words

被引:37
作者
Yi, Chucai [1 ]
Tian, Yingli [2 ]
机构
[1] CUNY, Grad Ctr, Dept Comp Sci, New York, NY 10021 USA
[2] City Univ New York, City Coll & Grad Ctr, Dept Elect Engn, New York, NY USA
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
Gabor Filter; Stroke Component; Suitability Measurement; Stroke Gabor Words; SGW Characteristic Distributions;
D O I
10.1109/ICDAR.2011.44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel algorithm, based on stroke components and descriptive Gabor filters, to detect text regions in natural scene images. Text characters and strings are constructed by stroke components as basic units. Gabor filters are used to describe and analyze the stroke components in text characters or strings. We define a suitability measurement to analyze the confidence of Gabor filters in describing stroke component and the suitability of Gabor filters on an image window. From the training set, we compute a set of Gabor filters that can describe principle stroke components of text by their parameters. Then a K - means algorithm is applied to cluster the descriptive Gabor filters. The clustering centers are defined as Stroke Gabor Words (SGWs) to provide a universal description of stroke components. By suitability evaluation on positive and negative training samples respectively, each SGW generates a pair of characteristic distributions of suitability measurements. On a testing natural scene image, heuristic layout analysis is applied first to extract candidate image windows. Then we compute the principle SGWs for each image window to describe its principle stroke components. Characteristic distributions generated by principle SGWs are used to classify text or non-text windows. Experimental results on benchmark datasets demonstrate that our algorithm can handle complex backgrounds and variant text patterns (font, color, scale, etc.).
引用
收藏
页码:177 / 181
页数:5
相关论文
共 21 条
[1]  
[Anonymous], 2003, ICDAR
[2]  
Banerjee J, 2009, PROC CVPR IEEE, P517, DOI 10.1109/CVPRW.2009.5206601
[3]  
Bhagavathy S., 2003, ICIP
[4]   Automatic detection and recognition of signs from natural scenes [J].
Chen, XL ;
Yang, J ;
Zhang, J ;
Waibel, A .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (01) :87-99
[5]  
Chen XR, 2004, PROC CVPR IEEE, P366
[6]   TWO-DIMENSIONAL SPECTRAL-ANALYSIS OF CORTICAL RECEPTIVE-FIELD PROFILES [J].
DAUGMAN, JG .
VISION RESEARCH, 1980, 20 (10) :847-856
[7]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[8]  
Hu SY, 2005, INT CONF ACOUST SPEE, P365
[9]  
Jain A. K., 1992, Machine Vision and Applications, V5, P169, DOI 10.1007/BF02626996
[10]   Text extraction and document image segmentation using matched wavelets and MRF model [J].
Kumar, Sunil ;
Gupta, Rajat ;
Khanna, Nitin ;
Chaudhury, Santanu ;
Joshi, Shiv Dutt .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (08) :2117-2128