Text detection and localization in natural scene images based on text awareness score

被引:22
作者
Soni, Rituraj [1 ]
Kumar, Bijendra [1 ]
Chand, Satish [2 ]
机构
[1] NSIT, Dept Comp Engn, New Delhi, India
[2] JNU, Sch Comp & Syst Sci, New Delhi, India
关键词
Text detection and localization; TAS; Fast edge preservation smoothing MSER; Bayesian method; NAIVE BAYES; EXTRACTION; SEGMENTATION;
D O I
10.1007/s10489-018-1338-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection & localization plays an essential role in finding the textual information from natural scene images that can be used in robot navigation, license plate detection, and wearable applications. In this work, we present text detection and localization approach based upon a novel text awareness model that encompasses an improved fast edge preserving and smoothing Maximum Stable Extremal Region (FEPS-MSER) algorithm which uses the fast guided filter to separate the interconnected characters efficiently by removing the mixed pixels around the edges of blurred images. The fast guided filter takes less execution time as compared to other edge-smoothing filters. The combination of five independent and class determining facets namely stroke width deviation, 8-histogram of edge gradients, color variation, occupation ratio, and occupy rate convex area is proposed to differentiate between text and non-text components. The probability of a component to be text is based on Text Awareness Score (TAS) that is calculated by fusing these facets in Naive Bayes using the observation possibility and prior probability of text & non-text components. Naive Bayes classifier helps in accurate and fast determination of the text awareness score and thus helps in the classification of text & non-text components with the help of graph cut algorithm. The text components have been grouped by using the mean-shift clustering algorithm which is a non-parametric technique and does not require the initial knowledge of clusters. The proposed method achieves improved results concerning precision, recall, and f-measure on the ICDAR benchmark datasets for natural scene images.
引用
收藏
页码:1376 / 1405
页数:30
相关论文
共 82 条
[21]   Feature subset selection using naive Bayes for text classification [J].
Feng, Guozhong ;
Guo, Jianhua ;
Jing, Bing-Yi ;
Sun, Tieli .
PATTERN RECOGNITION LETTERS, 2015, 65 :109-115
[22]  
Feng YY, 2016, INT C PATT RECOG, P645, DOI 10.1109/ICPR.2016.7899707
[23]  
Feng YY, 2015, 2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), P881, DOI 10.1109/ROBIO.2015.7418882
[24]  
FUKUNAGA K, 1940, IEEE TRANS INF THEOR, V21, P32
[25]   A Robust Approach for Scene Text Localization Using Rule-Based Confidence Map and Grouping [J].
Ghanei, Shaho ;
Faez, Karim .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (03)
[26]   A fast hierarchical method for multi-script and arbitrary oriented scene text extraction [J].
Gomez, Lluis ;
Karatzas, Dimosthenis .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2016, 19 (04) :335-349
[27]   Text Detection and Recognition on Traffic Panels From Street-Level Imagery Using Visual Appearance [J].
Gonzalez, Alvaro ;
Bergasa, Luis M. ;
Javier Yebes, J. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (01) :228-238
[28]  
González A, 2012, INT C PATT RECOG, P617
[29]  
Guan LB, 2017, 2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), P26, DOI 10.1109/ICIVC.2017.7984452
[30]  
Guo M., 2016, CHIN AC C PRINT PACK, P305