A Novel Text Sample Selection Model for Scene Text Detection via Bootstrap Learning

被引:1
作者
Kong, Jun [1 ,2 ]
Sun, Jinhua [1 ]
Jiang, Min [1 ]
Hou, Jian [1 ]
机构
[1] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Peoples R China
[2] Xinjiang Univ, Coll Elect Engn, Urumqi 830047, Peoples R China
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2019年 / 13卷 / 02期
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Text detection; bootstrap learning; image segmentation; text sample selection model; ENERGY MINIMIZATION; READING TEXT;
D O I
10.3837/tiis.2019.02.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text detection has been a popular research topic in the field of computer vision. It is difficult for prevalent text detection algorithms to avoid the dependence on datasets. To overcome this problem, we proposed a novel unsupervised text detection algorithm inspired by bootstrap learning. Firstly, the text candidate in a novel form of superpixel is proposed to improve the text recall rate by image segmentation. Secondly, we propose a unique text sample selection model (TSSM) to extract text samples from the current image and eliminate database dependency. Specifically, to improve the precision of samples, we combine maximally stable extremal regions (MSERs) and the saliency map to generate sample reference maps with a double threshold scheme. Finally, a multiple kernel boosting method is developed to generate a strong text classifier by combining multiple single kernel SVMs based on the samples selected from TSSM. Experimental results on standard datasets demonstrate that our text detection method is robust to complex backgrounds and multilingual text and shows stable performance on different standard datasets.
引用
收藏
页码:771 / 789
页数:19
相关论文
共 43 条
[1]  
[Anonymous], 2011, P BMVC
[2]  
[Anonymous], P INT WORKSH ROB REA
[3]  
[Anonymous], P 21 INT C MACH LEAR
[4]   Text/non-text image classification in the wild with convolutional neural networks [J].
Bai, Xiang ;
Shi, Baoguang ;
Zhang, Chengquan ;
Cai, Xuan ;
Qi, Li .
PATTERN RECOGNITION, 2017, 66 :437-446
[5]  
Berry J, 2012, 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, P1754
[6]   Fast approximate energy minimization via graph cuts [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) :1222-1239
[7]   An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision [J].
Boykov, Y ;
Kolmogorov, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) :1124-1137
[8]  
Chen H., 2011, 2011 18th IEEE International Conference on Image Processing (ICIP 2011), P2609, DOI 10.1109/ICIP.2011.6116200
[9]  
Chen XR, 2004, PROC CVPR IEEE, P366
[10]   Canny Text Detector: Fast and Robust Scene Text Localization Algorithm [J].
Cho, Hojin ;
Sung, Myungchul ;
Jun, Bongjin .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3566-3573