Robust Text Detection in Natural Scene Images

被引:454
作者
Yin, Xu-Cheng [1 ,2 ]
Yin, Xuwang [1 ]
Huang, Kaizhu [3 ]
Hao, Hong-Wei [4 ]
机构
[1] Univ Sci & Technol Beijing, Dept Comp Sci & Technol, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing Key Lab Mat Sci Knowledge Engn, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Scene text detection; maximally stable extremal regions; single-link clustering; distance metric learning; READING TEXT; LOCALIZATION; SEGMENTATION; EXTRACTION;
D O I
10.1109/TPAMI.2013.182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks. In this paper, we propose an accurate and robust method for detecting texts in natural scene images. A fast and effective pruning algorithm is designed to extract Maximally Stable Extremal Regions (MSERs) as character candidates using the strategy of minimizing regularized variations. Character candidates are grouped into text candidates by the single-link clustering algorithm, where distance weights and clustering threshold are learned automatically by a novel self-training distance metric learning algorithm. The posterior probabilities of text candidates corresponding to non-text are estimated with a character classifier; text candidates with high non-text probabilities are eliminated and texts are identified with a text classifier. The proposed system is evaluated on the ICDAR 2011 Robust Reading Competition database; the f-measure is over 76%, much better than the state-of-the-art performance of 71%. Experiments on multilingual, street view, multi-orientation and even born-digital databases also demonstrate the effectiveness of the proposed method. Finally, an online demo of our proposed scene text detection system has been set up at http://prir.ustb.edu.cn/TexStar/scene-text-detection/.
引用
收藏
页码:970 / 983
页数:14
相关论文
共 42 条
[1]  
[Anonymous], 2013, P 12 INT C DOC AN RE
[2]  
[Anonymous], 2004, ICML
[3]  
[Anonymous], P ICDAR
[4]  
[Anonymous], 2002, NIPS
[5]  
[Anonymous], 2008, VLFeat: An open and portable library of computer vision algorithms
[6]  
[Anonymous], 2001, PROC 18 INT C MACH L
[7]  
Chen H., 2011, 2011 18th IEEE International Conference on Image Processing (ICIP 2011), P2609, DOI 10.1109/ICIP.2011.6116200
[8]  
Chen XR, 2004, PROC CVPR IEEE, P366
[9]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[10]  
Hastie T., 2009, ELEMENTS STAT LEARNI, DOI 10.1007/978-0-387-84858-7