T-HOG: An effective gradient-based descriptor for single line text regions

被引:61
作者
Minetto, Rodrigo [1 ]
Thome, Nicolas [2 ]
Cord, Matthieu [2 ]
Leite, Neucimar J. [3 ]
Stolfi, Jorge [3 ]
机构
[1] Univ Tecnol Fed Parana, DAINF, Curitiba, Parana, Brazil
[2] Univ Paris 06, LIP6, Paris, France
[3] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil
关键词
Text detection; Text classification; Histogram of oriented gradients for text; Text descriptor; IMAGES;
D O I
10.1016/j.patcog.2012.10.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss the use of histogram of oriented gradients (HOG) descriptors as an effective tool for text description and recognition. Specifically, we propose a HOG-based texture descriptor (T-HOG) that uses a partition of the image into overlapping horizontal cells with gradual boundaries, to characterize single-line texts in outdoor scenes. The input of our algorithm is a rectangular image presumed to contain a single line of text in Roman-like characters. The output is a relatively short descriptor that provides an effective input to an SVM classifier. Extensive experiments show that the T-HOG is more accurate than Dalai and Triggs's original HOG-based classifier, for any descriptor size. In addition, we show that the T-HOG is an effective tool for text/non-text discrimination and can be used in various text detection applications. In particular, combining T-HOG with a permissive bottom-up text detector is shown to outperform state-of-the-art text detection systems in two major publicly available databases. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1078 / 1090
页数:13
相关论文
共 30 条
[1]   A two-stage scheme for text detection in video images [J].
Anthimopoulos, Marios ;
Gatos, Basilis ;
Pratikakis, Ioannis .
IMAGE AND VISION COMPUTING, 2010, 28 (09) :1413-1426
[2]   Text detection and recognition in images and video frames [J].
Chen, DT ;
Odobez, JM ;
Bourlard, H .
PATTERN RECOGNITION, 2004, 37 (03) :595-608
[3]  
Chen XR, 2004, PROC CVPR IEEE, P366
[4]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]  
de Recherche A.N., 2009, ITOWNS PROJECT
[7]  
Epshtein B., 2010, IEEE C COMP VIS PATT, P886
[8]  
Fairchild M.D., 2005, WILEY IST SERIES IMA
[9]   Operator context scanning to support high segmentation rates for real time license plate recognition [J].
Giannoukos, Ioannis ;
Anagnostopoulos, Christos-Nikolaos ;
Loumos, Vassili ;
Kayafas, Eleftherios .
PATTERN RECOGNITION, 2010, 43 (11) :3866-3878
[10]  
Hanif Shehzad Muhammad, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1, DOI 10.1109/ICDAR.2009.172