Text detection in natural scene images based on color prior guided MSER

被引:21
作者
Zhang, Xiangnan [1 ]
Gao, Xinbo [1 ]
Tian, Chunna [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Text detection; Text candidate extraction; Maximally stable extremal region; Stroke width transform; Text verification; Deep learning; LOCALIZATION; RECOGNITION; SEGMENTATION; VISION;
D O I
10.1016/j.neucom.2018.03.070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on text detection in natural scene images which is conducive to content-based wild image analysis and understanding. This task is still an open problem and usually includes two key issues: text candidate extraction and verification. For text candidate extraction, we introduce a color prior to guide the character candidate extraction by Maximally Stable Extremal Region (MSER). The principle of color prior acquirement is to obtain stroke-like textures with modified Stroke Width Transform (SWT), which is based on segmented edges. For text verification, the ideology of deep learning is adopted to distinguish text/non-text candidates. To improve classification accuracy, the results of specific task CNNs are fused. The proposed framework is evaluated on the ICDAR 2013 Robust Reading Competition database. It achieves F-score at 85.87%, which are superior over several state-of-the-art text detection methods. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:61 / 71
页数:11
相关论文
共 51 条
[21]   ICDAR 2013 Robust Reading Competition [J].
Karatzas, Dimosthenis ;
Shafait, Faisal ;
Uchida, Seiichi ;
Iwamura, Masakazu ;
Gomez i Bigorda, Lluis ;
Robles Mestre, Sergi ;
Mas, Joan ;
Fernandez Mota, David ;
Almazan Almazan, Jon ;
Pere de las Heras, Lluis .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1484-1493
[22]   AdaBoost for Text Detection in Natural Scene [J].
Lee, Jung-Jin ;
Lee, Pyoung-Hean ;
Lee, Seong-Whan ;
Yuille, Alan ;
Koch, Christof .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :429-434
[23]  
Liao MH, 2017, AAAI CONF ARTIF INTE, P4161
[24]   Lexicon-driven segmentation and recognition of handwritten character strings for Japanese address reading [J].
Liu, CL ;
Koga, M ;
Fujisawa, H .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (11) :1425-1437
[25]   Scene text extraction based on edges and support vector regression [J].
Lu, Shijian ;
Chen, Tao ;
Tian, Shangxuan ;
Lim, Joo-Hwee ;
Tan, Chew-Lim .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (02) :125-135
[26]  
Matas J., 2002, Electronic Proceedings of the 13th British Machine Vision Conference, P384
[27]   Real-Time Lexicon-Free Scene Text Localization and Recognition [J].
Neumann, Lukas ;
Matas, Jiri .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) :1872-1885
[28]  
Neumann L, 2011, LECT NOTES COMPUT SC, V6494, P770, DOI 10.1007/978-3-642-19318-7_60
[29]  
Shi B., 2017, CoRR
[30]  
Simonyan K., 2015, ICLR