Text detection and localization in natural scene images based on text awareness score

被引:21
作者
Soni, Rituraj [1 ]
Kumar, Bijendra [1 ]
Chand, Satish [2 ]
机构
[1] NSIT, Dept Comp Engn, New Delhi, India
[2] JNU, Sch Comp & Syst Sci, New Delhi, India
关键词
Text detection and localization; TAS; Fast edge preservation smoothing MSER; Bayesian method; NAIVE BAYES; EXTRACTION; SEGMENTATION;
D O I
10.1007/s10489-018-1338-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection & localization plays an essential role in finding the textual information from natural scene images that can be used in robot navigation, license plate detection, and wearable applications. In this work, we present text detection and localization approach based upon a novel text awareness model that encompasses an improved fast edge preserving and smoothing Maximum Stable Extremal Region (FEPS-MSER) algorithm which uses the fast guided filter to separate the interconnected characters efficiently by removing the mixed pixels around the edges of blurred images. The fast guided filter takes less execution time as compared to other edge-smoothing filters. The combination of five independent and class determining facets namely stroke width deviation, 8-histogram of edge gradients, color variation, occupation ratio, and occupy rate convex area is proposed to differentiate between text and non-text components. The probability of a component to be text is based on Text Awareness Score (TAS) that is calculated by fusing these facets in Naive Bayes using the observation possibility and prior probability of text & non-text components. Naive Bayes classifier helps in accurate and fast determination of the text awareness score and thus helps in the classification of text & non-text components with the help of graph cut algorithm. The text components have been grouped by using the mean-shift clustering algorithm which is a non-parametric technique and does not require the initial knowledge of clusters. The proposed method achieves improved results concerning precision, recall, and f-measure on the ICDAR benchmark datasets for natural scene images.
引用
收藏
页码:1376 / 1405
页数:30
相关论文
共 82 条
  • [1] Abdi H., 2010, ENCY RES DESIGN, V169, P1, DOI DOI 10.4135/9781412961288.N178
  • [2] Al-khurayji R., 2017, Int. J. Artif. Intell, V8, P01, DOI [10.5121/ijaia.2017.8601, DOI 10.5121/IJAIA.2017.8601]
  • [3] [Anonymous], IJCAI 2001 WORKSHOP
  • [4] [Anonymous], 2015, ARXIV150307297
  • [5] [Anonymous], 2015, ARXIV150500996
  • [6] Scene Text Localization Using Gradient Local Correlation
    Bai, Bo
    Yin, Fei
    Liu, Cheng-Lin
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1380 - 1384
  • [7] Automated Text Detection and Character Recognition in Natural Scenes Based on Local Image Features and Contour Processing Techniques
    Baran, Remigiusz
    Partila, Pavol
    Wilk, Rafal
    [J]. INTELLIGENT HUMAN SYSTEMS INTEGRATION, IHSI 2018, 2018, 722 : 42 - 48
  • [8] Efficiently mining frequent itemsets applied for textual aggregation
    Bouakkaz, Mustapha
    Ouinten, Youcef
    Loudcher, Sabine
    Fournier-Viger, Philippe
    [J]. APPLIED INTELLIGENCE, 2018, 48 (04) : 1013 - 1019
  • [9] Fast approximate energy minimization via graph cuts
    Boykov, Y
    Veksler, O
    Zabih, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) : 1222 - 1239
  • [10] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167