Scene Text Recognition Using Structure-Guided Character Detection and Linguistic Knowledge

被引:29
作者
Shi, Cun-Zhao [1 ]
Wang, Chun-Heng [1 ]
Xiao, Bai-Hua [1 ]
Gao, Song [1 ]
Hu, Jin-Long [1 ]
机构
[1] CASIA, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
关键词
Character recognition; cropped word recognition; part-based tree-structured models (TSMs); posterior probability; scene text recognition; word spotting; READING TEXT; SEGMENTATION; IMAGES;
D O I
10.1109/TCSVT.2014.2302522
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Scene text recognition has inspired great interests from the computer vision community in recent years. In this paper, we propose a novel scene text-recognition method integrating structure-guided character detection and linguistic knowledge. We use part-based tree structure to model each category of characters so as to detect and recognize characters simultaneously. Since the character models make use of both the local appearance and global structure informations, the detection results are more reliable. For word recognition, we combine the detection scores and language model into the posterior probability of character sequence from the Bayesian decision view. The final word-recognition result is obtained by maximizing the character sequence posterior probability via Viterbi algorithm. Experimental results on a range of challenging public data sets (ICDAR 2003, ICDAR 2011, SVT) demonstrate that the proposed method achieves state-of-the-art performance both for character detection and word recognition.
引用
收藏
页码:1235 / 1250
页数:16
相关论文
共 53 条
[1]  
[Anonymous], 2014, ABBYY FINEREADER 9 0
[2]  
[Anonymous], 2005, PROC CVPR IEEE
[3]  
[Anonymous], P CVPR
[4]  
[Anonymous], 2000, Pattern Classification, DOI DOI 10.1007/978-3-319-57027-3_4
[5]  
[Anonymous], UMCS2012021
[6]  
[Anonymous], 1985, INTRO DIGITAL IMAGE
[7]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[8]  
Berg AC, 2005, PROC CVPR IEEE, P26
[9]  
Chen DT, 2002, INT C PATT RECOG, P227, DOI 10.1109/ICPR.2002.1047438
[10]   Automatic detection and recognition of signs from natural scenes [J].
Chen, XL ;
Yang, J ;
Zhang, J ;
Waibel, A .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (01) :87-99