Binarization of Color Character Strings in Scene Images Using K -means Clustering and Support Vector Machines

被引:21
作者
Wakahara, Toru [1 ]
Kita, Kohei [1 ]
机构
[1] Hosei Univ, Fac Comp & Informat Sci, Koganei, Tokyo 1818584, Japan
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
binarization of multicolored character strings; K -means clustering; support vector machines;
D O I
10.1109/ICDAR.2011.63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of binalizing multicolored character strings in scene images subject to heavy image degradations and complex backgrounds. The proposed method consists of four steps. The first step generates tentatively binarized images via every dichotomization of.. clusters obtained by K -means clustering of constituent pixels of a given image in the HSI color space. The total number of tentatively binarized images equals 2(K) - 2. The second step divides each binarized image into a sequence of "single-character-like" images using an average aspect ratio of a character. The third step is use of support vector machines (SVM) to determine whether each "single-character-like" image represents a character or non-character. We feed the SVM with the mesh feature to output the degree of "character-likeness." The fourth step selects a single binarized image with the maximum average of "character-likeness" as an optimal binarization result. Experiments using a total of 1000 character strings extracted from the ICDAR 2003 robust word recognition dataset show that the proposed method achieves a correct binarization rate of 80.8%.
引用
收藏
页码:274 / 278
页数:5
相关论文
共 9 条
  • [1] [Anonymous], 2006, Pattern recognition and machine learning
  • [2] Ashida K., 2005, Transactions of the Institute of Electronics, Information and Communication Engineers D-II, VJ88-D-II, P1817
  • [3] Doermann D, 2003, PROC INT CONF DOC, P606
  • [4] JOACHIMS T, 1998, ADV KERNEL METHODS S, pCH11
  • [5] Kita K., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3183, DOI 10.1109/ICPR.2010.779
  • [6] Lucas SM, 2003, PROC INT CONF DOC, P682
  • [7] GOAL-DIRECTED EVALUATION OF BINARIZATION METHODS
    TRIER, OD
    JAIN, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (12) : 1191 - 1201
  • [8] Character location in scene images from digital camera
    Wang, K
    Kangas, JA
    [J]. PATTERN RECOGNITION, 2003, 36 (10) : 2287 - 2299
  • [9] Wu S, 2003, PROC INT CONF DOC, P493