Text Localization in Natural Images Through Effective Re-Identification of the MSER

被引:0
作者
Mahmood, Hanaa F. [1 ]
Li, Baihua [1 ]
Edirisinghe, Eran [1 ]
机构
[1] Loughborough Univ, Comp Sci Dept, Loughborough, Leics, England
来源
PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17) | 2017年
关键词
text detection; scene images; ICDAR; feature selection;
D O I
10.1145/3109761.3109803
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection and recognition from images have numerous applications for document analysis and information retrieval tasks. An accurate and robust method for detecting texts in natural scene images is proposed in this paper. Text-region candidates are detected using maximally stable extremal regions (MSER) and a machine learning based method is then applied to refine and validate the initial detection. The effectiveness of features based on aspect ratio, GLSM, LBP, HOG descriptors are investigated. Text-region classifiers of MLP, SVM and RF are trained using selections of these features and their combination. A publicly available multilingual dataset ICDAR 2003,2011 has been used to evaluate the method. The proposed method achieved excellent performance on both databases and the improvements are significant in terms of Precision, Recall, and F-measure. The results show that using a suitable feature combination and selection approach can can significantly increase the accuracy of the algorithms. Keywords-text detection; scene images; ICDAR; feature selection
引用
收藏
页数:9
相关论文
共 27 条
[1]  
Chen H., 2011, 2011 18th IEEE International Conference on Image Processing (ICIP 2011), P2609, DOI 10.1109/ICIP.2011.6116200
[2]  
Chowdhury AR, 2012, INT C PATT RECOG, P294
[3]   An analysis of co-occurrence texture statistics as a function of grey level quantization [J].
Clausi, DA .
CANADIAN JOURNAL OF REMOTE SENSING, 2002, 28 (01) :45-62
[4]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[5]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[6]  
Gomez L., 2013, ICDAR
[7]  
Gonzalez Alvaro, 2012, PATT REC ICPR 21 INT
[8]  
Hanif S. M., 2009, 10 INT C DOC AN REC
[9]   TEXTURAL FEATURES FOR IMAGE CLASSIFICATION [J].
HARALICK, RM ;
SHANMUGAM, K ;
DINSTEIN, I .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1973, SMC3 (06) :610-621
[10]  
Hiremath P. S., 2015, MULTILINGUAL TEXT LO, V4, P210