Scene text extraction in natural scene images using hierarchical feature combining and verification

被引:54
作者
Kim, KC [1 ]
Byun, HR [1 ]
Song, YJ [1 ]
Choi, YW [1 ]
Chi, SY [1 ]
Kim, KK [1 ]
Chung, YK [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
来源
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2 | 2004年
关键词
D O I
10.1109/ICPR.2004.1334350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a method that extracts text regions in natural scene images using low-level image features and that verifies the extracted regions through a high-level text stroke feature. Then the two level features are combined hierarchically. The low-level features are color continuity, gray-level variation and color variance. The color continuity is used since most of the characters in a text region have the same color, and the gray-level variation is used since the text strokes are distinctive to the background in their gray-level values. Also, the color variance is used since the text strokes are distinctive in their colors to the background, and this value is more sensitive than the gray-level variations. As a high level feature, text stroke is examined using multi-resolution wavelet transforms on local image areas and the feature vector is input to a SVM(Support Vector Machine) for verification. We tested the proposed method with various kinds of the natural scene images and confirmed that extraction rates are high even in complex images.
引用
收藏
页码:679 / 682
页数:4
相关论文
共 7 条
  • [1] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [2] Automatic text location in images and video frames
    Jain, AK
    Yu, B
    [J]. PATTERN RECOGNITION, 1998, 31 (12) : 2055 - 2076
  • [3] LI C, 2001, P 6 INT C DOC AN REC, P1069, DOI DOI 10.1109/ICDAR.2001.953950
  • [4] Automatic text detection and tracking in digital video
    Li, HP
    Doermann, D
    Kia, O
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (01) : 147 - 156
  • [5] OHYA J, 1995, IEEE T PATTERN ANAL, V16, P67
  • [6] LOCATING TEXT IN COMPLEX COLOR IMAGES
    ZHONG, Y
    KARU, K
    JAIN, AK
    [J]. PATTERN RECOGNITION, 1995, 28 (10) : 1523 - 1535
  • [7] Automatic caption localization in compressed video
    Zhong, Y
    Zhang, HJ
    Jain, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (04) : 385 - 392