Scene Text Recognition using Part-based Tree-structured Character Detection

被引:87
作者
Shi, Cunzhao [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
Zhang, Yang [1 ]
Gao, Song [1 ]
Zhang, Zhong [1 ]
机构
[1] CASIA, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
来源
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2013年
关键词
D O I
10.1109/CVPR.2013.381
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text recognition has inspired great interests from the computer vision community in recent years. In this paper, we propose a novel scene text recognition method using part-based tree-structured character detection. Different from conventional multi-scale sliding window character detection strategy, which does not make use of the character-specific structure information, we use part-based tree-structure to model each type of character so as to detect and recognize the characters at the same time. While for word recognition, we build a Conditional Random Field model on the potential character locations to incorporate the detection scores, spatial constraints and linguistic knowledge into one framework. The final word recognition result is obtained by minimizing the cost function defined on the random field. Experimental results on a range of challenging public datasets (ICDAR 2003, ICDAR 2011, SVT) demonstrate that the proposed method outperforms state-of-the-art methods significantly both for character detection and word recognition.
引用
收藏
页码:2961 / 2968
页数:8
相关论文
共 20 条
[1]  
Chen XR, 2004, PROC CVPR IEEE, P366
[2]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[3]  
de Campos T., 2009, VISAP
[4]   Pictorial structures for object recognition [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (01) :55-79
[5]  
Judd T, 2009, IEEE I CONF COMP VIS, P2106, DOI 10.1109/ICCV.2009.5459462
[6]   Convergent tree-reweighted message passing for energy minimization [J].
Kolmogorov, Vladimir .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (10) :1568-1583
[7]  
Lucas SM, 2003, PROC INT CONF DOC, P682
[8]  
Mishra A., 2012, CVPR
[9]   Scene Text Recognition using Higher Order Language Priors [J].
Mishra, Anand ;
Alahari, Karteek ;
Jawahar, C. V. .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[10]  
Neumann L, 2012, PROC CVPR IEEE, P3538, DOI 10.1109/CVPR.2012.6248097