T-HOG: An effective gradient-based descriptor for single line text regions

被引：61

作者：

Minetto, Rodrigo ^{[1
]}

Thome, Nicolas ^{[2
]}

Cord, Matthieu ^{[2
]}

Leite, Neucimar J. ^{[3
]}

Stolfi, Jorge ^{[3
]}

机构：

[1] Univ Tecnol Fed Parana, DAINF, Curitiba, Parana, Brazil

[2] Univ Paris 06, LIP6, Paris, France

[3] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil

来源：

PATTERN RECOGNITION | 2013年 / 46卷 / 03期

关键词：

Text detection; Text classification; Histogram of oriented gradients for text; Text descriptor; IMAGES;

D O I：

10.1016/j.patcog.2012.10.009

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We discuss the use of histogram of oriented gradients (HOG) descriptors as an effective tool for text description and recognition. Specifically, we propose a HOG-based texture descriptor (T-HOG) that uses a partition of the image into overlapping horizontal cells with gradual boundaries, to characterize single-line texts in outdoor scenes. The input of our algorithm is a rectangular image presumed to contain a single line of text in Roman-like characters. The output is a relatively short descriptor that provides an effective input to an SVM classifier. Extensive experiments show that the T-HOG is more accurate than Dalai and Triggs's original HOG-based classifier, for any descriptor size. In addition, we show that the T-HOG is an effective tool for text/non-text discrimination and can be used in various text detection applications. In particular, combining T-HOG with a permissive bottom-up text detector is shown to outperform state-of-the-art text detection systems in two major publicly available databases. (C) 2012 Elsevier Ltd. All rights reserved.

引用

页码：1078 / 1090

页数：13

共 30 条

[1] A two-stage scheme for text detection in video images [J].

Anthimopoulos, Marios ;

Gatos, Basilis ;

Pratikakis, Ioannis .

IMAGE AND VISION COMPUTING, 2010, 28 (09) :1413-1426

[2] Text detection and recognition in images and video frames [J].

Chen, DT ;

Odobez, JM ;

Bourlard, H .

PATTERN RECOGNITION, 2004, 37 (03) :595-608

[3]

Chen XR, 2004, PROC CVPR IEEE, P366

[4] SUPPORT-VECTOR NETWORKS [J].

CORTES, C ;

VAPNIK, V .

MACHINE LEARNING, 1995, 20 (03) :273-297

[5] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[6]

de Recherche A.N., 2009, ITOWNS PROJECT

[7]

Epshtein B., 2010, IEEE C COMP VIS PATT, P886

[8]

Fairchild M.D., 2005, WILEY IST SERIES IMA

[9] Operator context scanning to support high segmentation rates for real time license plate recognition [J].

Giannoukos, Ioannis ;

Anagnostopoulos, Christos-Nikolaos ;

Loumos, Vassili ;

Kayafas, Eleftherios .

PATTERN RECOGNITION, 2010, 43 (11) :3866-3878

[10]

Hanif Shehzad Muhammad, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1, DOI 10.1109/ICDAR.2009.172

← 1 2 3 →