A text reading algorithm for natural images

被引:40
作者
Gonzalez, Alvaro [1 ]
Miguel Bergasa, Luis [1 ]
机构
[1] Univ Alcala de Henares, Dept Elect, Alcala De Henares 28871, Madrid, Spain
关键词
Text detection; Text recognition; Character recognition; Character segmentation; Natural images; Scene text detection; CLASSIFICATION; SEGMENTATION;
D O I
10.1016/j.imavis.2013.01.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reading text in natural images has focused again the attention of many researchers during the last few years due to the increasing availability of cheap image-capturing devices in low-cost products like mobile phones. Therefore, as text can be found on any environment, the applicability of text-reading systems is really extensive. For this purpose, we present in this paper a robust method to read text in natural images. It is composed of two main separated stages. Firstly, text is located in the image using a set of simple and fast-to-compute features highly discriminative between character and non-character objects. They are based on geometric and gradient properties. The second part of the system carries out the recognition of the previously detected text. It uses gradient features to recognize single characters and Dynamic Programming (DP) to correct misspelled words. Experimental results obtained with different challenging datasets show that the proposed system exceeds state-of-the-art performance, both in terms of localization and recognition. (c) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:255 / 274
页数:20
相关论文
共 38 条
  • [1] Alcantarilla P.F., 2011, THESIS U ALCALA ALCA
  • [2] [Anonymous], 2005, PROC CVPR IEEE
  • [3] [Anonymous], C COMP VIS PATT REC
  • [4] [Anonymous], 2008, VLFeat: An open and portable library of computer vision algorithms
  • [5] Speeded-Up Robust Features (SURF)
    Bay, Herbert
    Ess, Andreas
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) : 346 - 359
  • [6] Shape matching and object recognition using shape contexts
    Belongie, S
    Malik, J
    Puzicha, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 509 - 522
  • [7] Berg AC, 2005, PROC CVPR IEEE, P26
  • [8] Chen H., 2011, ICIP
  • [9] Chen XR, 2004, PROC CVPR IEEE, P366
  • [10] Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning
    Coates, Adam
    Carpenter, Blake
    Case, Carl
    Satheesh, Sanjeev
    Suresh, Bipin
    Wang, Tao
    Wu, David J.
    Ng, Andrew Y.
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 440 - 445