TEXT LOCATION IN SCENE IMAGES USING VISUAL ATTENTION MODEL

被引:1
作者
Sun, Qiao-Yu [1 ,2 ]
Lu, Yue [1 ]
机构
[1] E China Normal Univ, Dept Comp Sci & Technol, Shanghai 200241, Peoples R China
[2] Huaihai Inst Technol, Dept Elect Engn, Lianyungang 222005, Jiangsu, Peoples R China
关键词
Text location; visual attention; histogram of edge direction; connected component analysis; edge map; VIDEO; EXTRACTION;
D O I
10.1142/S0218001412550087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Locating text region from an image of nature scene is significantly helpful for better understanding the semantic meaning of the image, which plays an important role in many applications such as image retrieval, image categorization, social media processing, etc. Traditional approach relies on the low level image features to progressively locate the candidate text regions. However, these approaches often suffer for the cases of the clutter background since the adopted low level image features are fairly simple which may not reliably distinguish text region from the clutter background. Motivated by the recent popular research on attention model, salience detection is revisited in this paper. Based on the case of text detection on nature scene image, saliency map is further analyzed and is adjusted accordingly. Using the adjusted saliency map, the candidate text regions detected by the common low level features are further verified. Moreover, efficient low level text feature, Histogram of Edge-direction (HOE), is adopted in this paper, which statistically describes the edge direction information of the region of interest on the image. Encouraging experimental results have been obtained on the nature scene images with the text of various languages.
引用
收藏
页数:22
相关论文
共 33 条
[1]   A two-stage scheme for text detection in video images [J].
Anthimopoulos, Marios ;
Gatos, Basilis ;
Pratikakis, Ioannis .
IMAGE AND VISION COMPUTING, 2010, 28 (09) :1413-1426
[2]  
Byung Tae Chun, 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315), P1112, DOI 10.1109/FUZZY.1999.793110
[3]  
Clark P., 2000, BMV2000. Proceedings of the 11th British Machine Vision Conference, P675
[4]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[5]  
Hanif Shehzad Muhammad, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1, DOI 10.1109/ICDAR.2009.172
[6]   Automatic detection and verification of text regions in news video frames [J].
Hu, JM ;
Xi, J ;
Wu, LD .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2002, 16 (02) :257-271
[7]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[8]  
Jain A. K., 1992, Machine Vision and Applications, V5, P169, DOI 10.1007/BF02626996
[9]   Automatic text location in images and video frames [J].
Jain, AK ;
Yu, B .
PATTERN RECOGNITION, 1998, 31 (12) :2055-2076
[10]  
Jing Zhang, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3979, DOI 10.1109/ICPR.2010.968