Text segmentation by integrating hybrid strategy and non-text filtering

被引:0
作者
Minhua Li
Meng Bai
Yingjun Lv
机构
[1] Shandong University of Science and Technology,Department of Electrical Engineering and Information Technology
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Text segmentation; Intensity; Stroke width; Integration; Non-text pixel filtering;
D O I
暂无
中图分类号
学科分类号
摘要
The text embedded in images provides important information for image understanding. Text segmentation is an essential step for text recognition. It is often difficult to segment text from images at low resolution or with complex background. In this paper, a novel text segmentation framework is proposed to solve the problem. The proposed framework adopts a hybrid strategy integrating two different text segmentation methods to produce text candidates. One segmentation method is designed based on the intensity uniformity of text regions, while the other is developed by integrating the features of intensity and stroke width of text. To separate text pixels from the text candidates, a new non-text pixel filtering method is proposed. In the filtering method, an effective classifier is designed based on the number of breaking elements and the k-means clustering algorithm. The performance of the proposed segmentation framework is tested by the pixel-based and recognition-based evaluation methods. Experimental results show that the F-score of the proposed framework on the video caption dataset and born-digital dataset of ICDAR2013 are 95.29% and 89.09% respectively, while the correctly recognized character rate and word rate on the German TV public dataset are 91.00% and 72.33%. The experimental results indicate that the proposed text segmentation framework has excellent performance and high robustness in text segmentation and recognition.
引用
收藏
页码:44505 / 44522
页数:17
相关论文
共 56 条
  • [1] Bataineh B(2011)An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows Pattern Recogn Lett 32 1805-1813
  • [2] Abdullah S(2001)Active contours without edges IEEE Trans Image Process 10 266-277
  • [3] Omar K(2010)Conditional random field for text segmentation from images with complex background Pattern Recogn Lett 31 2295-2308
  • [4] Chan T(2012)A local region-based Chan-Vese model for image segmentation Pattern Recogn 45 2769-2779
  • [5] Vese L(2016)Robust text detection via multi-degree of sharpening and blurring Sig Process 124 259-265
  • [6] Li M(2007)Color text extraction with selective metric-based clustering Comput Vis Image Underst 107 97-107
  • [7] Bai M(2004)Robust wide-baseline stereo from maximally stable extremal regions Image Vis Comput 22 761-767
  • [8] Wang C(2012)AdOtsu: An adaptive and parameterless generalization of Otsu’s method for document image binarization Pattern Recogn 45 2419-2431
  • [9] Liu S(2013)Automatic inpainting scheme for video text detection and removal IEEE Trans Image Process 22 4460-4472
  • [10] Peng Y(1978)Thresholding selection method from gray-level histogram IEEE Trans Syst Man Cybern 9 62-66