Text detection in natural scene images based on color prior guided MSER

被引:21
作者
Zhang, Xiangnan [1 ]
Gao, Xinbo [1 ]
Tian, Chunna [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Text detection; Text candidate extraction; Maximally stable extremal region; Stroke width transform; Text verification; Deep learning; LOCALIZATION; RECOGNITION; SEGMENTATION; VISION;
D O I
10.1016/j.neucom.2018.03.070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on text detection in natural scene images which is conducive to content-based wild image analysis and understanding. This task is still an open problem and usually includes two key issues: text candidate extraction and verification. For text candidate extraction, we introduce a color prior to guide the character candidate extraction by Maximally Stable Extremal Region (MSER). The principle of color prior acquirement is to obtain stroke-like textures with modified Stroke Width Transform (SWT), which is based on segmented edges. For text verification, the ideology of deep learning is adopted to distinguish text/non-text candidates. To improve classification accuracy, the results of specific task CNNs are fused. The proposed framework is evaluated on the ICDAR 2013 Robust Reading Competition database. It achieves F-score at 85.87%, which are superior over several state-of-the-art text detection methods. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:61 / 71
页数:11
相关论文
共 51 条
[1]  
[Anonymous], 2017, CORR
[2]  
[Anonymous], 2014, IEEE T IMAGE PROCESS, DOI DOI 10.1109/TIP.2014.2353813
[3]   Vision-based target geo-location using a fixed-wing miniature air vehicle [J].
Barber, D. Blake ;
Redding, Joshua D. ;
McLain, Timothy W. ;
Beard, Randal W. ;
Taylor, Clark N. .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2006, 47 (04) :361-382
[4]   PhotoOCR: Reading Text in Uncontrolled Conditions [J].
Bissacco, Alessandro ;
Cummins, Mark ;
Netzer, Yuval ;
Neven, Hartmut .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :785-792
[6]  
Chen H., 2011, 2011 18th IEEE International Conference on Image Processing (ICIP 2011), P2609, DOI 10.1109/ICIP.2011.6116200
[7]   Vision for mobile robot navigation: A survey [J].
DeSouza, GN ;
Kak, AC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (02) :237-267
[8]   Fast Edge Detection Using Structured Forests [J].
Dollar, Piotr ;
Zitnick, C. Lawrence .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (08) :1558-1570
[9]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545
[10]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041