Text segmentation using superpixel clustering

被引:9
作者
Zhu, Yuanping [1 ]
Zhang, Kuang [1 ]
机构
[1] Tianjin Normal Univ, Dept Comp Sci, 393 Binshuixi Rd, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
text detection; document image processing; image representation; image enhancement; image resolution; image segmentation; pattern clustering; natural scenes; iterative methods; image texture; text segmentation; text image analysis; text image recognition; superpixel-based image representation; local disturbances; text image superpixels; adaptive linear iterative clustering-based text superpixel generation; adaptive superpixel size; adaptive superpixel compactness; boundary adherence; homogeneous superpixels; modified density-based spatial clustering; stroke superpixel verification; KAIST scene text dataset; ICDAR2003 natural scene text image dataset; Street View Text dataset; RECOGNITION; IMAGES; VIDEO;
D O I
10.1049/iet-ipr.2016.0914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text segmentation is important for text image analysis and recognition; however, it is challenging due to noise and complex background in natural scenes. Superpixel-based image representation can enhance robustness to noise and local disturbances, but conventional superpixel algorithms are difficult to obtain the complete stroke regions and accurate boundaries for text images. In this study, a text segmentation method based on superpixel clustering is proposed. First, to generate accurate superpixels for text images, an adaptive simple linear iterative clustering-based text superpixel generation algorithm is proposed. The adaptive superpixel size and compactness are calculated to enhance boundary adherence. Second, to increase the complete coverage of strokes from superpixels, superpixel clustering merges homogeneous superpixels into larger regions for both strokes and the background. A modified density-based spatial clustering of applications with noise is proposed. Finally, stroke superpixel verification assigns each region to a stroke or to the background and the text segmentation result is obtained. The proposed method shows promising robustness to noise and complex background textures. Experimental results on the Korea Advanced Institute of Science and Technology (KAIST) scene text dataset, International Conference on Document Analysis and Recognition (ICDAR) 2003 natural scene text image dataset and Street View Text dataset verify that this method is effective and significantly outperforms existing methods.
引用
收藏
页码:455 / 464
页数:10
相关论文
共 29 条
[1]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[2]  
[Anonymous], TECHNICAL REPORT
[3]   Strokelets: A Learned Multi-Scale Mid-Level Representation for Scene Text Recognition [J].
Bai, Xiang ;
Yao, Cong ;
Liu, Wenyu .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) :2789-2802
[4]   Scene Text Extraction by Superpixel CRFs Combining Multiple Character Features [J].
Cho, Min Su ;
Seok, Jae-Hyun ;
Lee, Seonghun ;
Kim, Jin Hyung .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :1034-1038
[5]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[6]  
Ester M., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P226
[7]  
Jaderberg M, 2014, LECT NOTES COMPUT SC, V8692, P512, DOI 10.1007/978-3-319-10593-2_34
[8]   Touch TT: Scene Text Extractor Using Touchscreen Interface [J].
Jung, Jehyun ;
Lee, SeongHun ;
Cho, Min Su ;
Kim, Jin Hyung .
ETRI JOURNAL, 2011, 33 (01) :78-88
[9]   Text information extraction in images and video: a survey [J].
Jung, K ;
Kim, KI ;
Jain, AK .
PATTERN RECOGNITION, 2004, 37 (05) :977-997
[10]   Region-based Discriminative Feature Pooling for Scene Text Recognition [J].
Lee, Chen-Yu ;
Bhardwaj, Anurag ;
Di, Wei ;
Jagadeesh, Vignesh ;
Piramuthu, Robinson .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :4050-4057