Text extraction from scene images by character appearance and structure modeling

被引:80
作者
Yi, Chucai
Tian, Yingli [1 ]
机构
[1] CUNY, City Coll New York, New York, NY 10031 USA
关键词
Text detection; Scene image; Character appearance; Structure modeling; Structure difference; Structure component co-occurrence; Character identification; CLASSIFICATION;
D O I
10.1016/j.cviu.2012.11.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel algorithm to detect text information from natural scene images. Scene text classification and detection are still open research topics. Our proposed algorithm is able to model both character appearance and structure to generate representative and discriminative text descriptors. The contributions of this paper include three aspects: (1) a new character appearance model by a structure correlation algorithm which extracts discriminative appearance features from detected interest points of character samples; (2) a new text descriptor based on structons and correlatons, which model character structure by structure differences among character samples and structure component co-occurrence; and (3) a new text region localization method by combining color decomposition, character contour refinement, and string line alignment to localize character candidates and refine detected text regions. We perform three groups of experiments to evaluate the effectiveness of our proposed algorithm, including text classification, text detection, and character identification. The evaluation results on benchmark datasets demonstrate that our algorithm achieves the state-of-the-art performance on scene text classification and detection, and significantly outperforms the existing algorithms for character identification. (c) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:182 / 194
页数:13
相关论文
共 37 条
[1]  
[Anonymous], INT J DOCUMENT ANAL
[2]  
[Anonymous], 2004, INT J COMPUTER VISIO
[3]  
[Anonymous], INT J DOC ANAL RECOG
[4]  
Chen H., 2011, 2011 18th IEEE International Conference on Image Processing (ICIP 2011), P2609, DOI 10.1109/ICIP.2011.6116200
[5]   Automatic detection and recognition of signs from natural scenes [J].
Chen, XL ;
Yang, J ;
Zhang, J ;
Waibel, A .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (01) :87-99
[6]  
Chen XR, 2004, PROC CVPR IEEE, P366
[7]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[8]  
Davis J.V., 2007, P 24 INT C MACH LEAR, P209, DOI DOI 10.1145/1273496.1273523
[9]  
de Campos T.E., P INT C COMP VIS THE
[10]  
Dinh VC, 2007, LECT NOTES COMPUT SC, V4843, P200