A two-stage scheme for text detection in video images

被引:60
作者
Anthimopoulos, Marios [1 ]
Gatos, Basilis [1 ]
Pratikakis, Ioannis [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Computat Intelligence Lab, Athens 15310, Greece
关键词
Text detection; Video OCR; Content-based indexing; SVM; EXTRACTION; SEGMENTATION; LOCALIZATION;
D O I
10.1016/j.imavis.2010.03.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a two-stage system for text detection in video images. In the first stage, text lines are detected based on the edge map of the image leading in a high recall rate with low computational time expenses. In the second stage, the result is refined using a sliding window and an SVM classifier trained on features obtained by a new Local Binary Pattern-based operator (eLBP) that describes the local edge distribution. The whole algorithm is used in a multiresolution fashion enabling detection of characters for a broad size range. Experimental results, based on a new evaluation methodology, show the promising overall performance of the system on a challenging corpus, and prove the superior discriminating ability of the proposed feature set against the best features reported in the literature. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:1413 / 1426
页数:14
相关论文
共 37 条
[1]  
ANTHIMOPOULOS M, 2007, INT C COMP VIS THEOR, P161
[2]   A Hybrid System for Text Detection in Video Frames [J].
Anthimopoulos, Marios ;
Gatos, Basilis ;
Pratikakis, Ioannis .
PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, :286-292
[3]   Image coding using wavelet transform [J].
Antonini, Marc ;
Barlaud, Michel ;
Mathieu, Pierre ;
Daubechies, Ingrid .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1992, 1 (02) :205-220
[4]  
Cai M, 2002, IEEE IMAGE PROC, P117
[6]   A localization/verification scheme for finding text in images and video frames based on contrast independent features and machine learning methods [J].
Chen, DT ;
Odobez, JM ;
Thiran, JP .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2004, 19 (03) :205-217
[7]   Extraction of special effects caption text events from digital video [J].
David Crandall ;
Sameer Antani ;
Rangachar Kasturi .
International Journal on Document Analysis and Recognition, 2003, 5 (2) :138-157
[8]  
Doermann D, 2003, PROC INT CONF DOC, P606
[9]  
Gargi U., 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), P29, DOI 10.1109/ICDAR.1999.791717
[10]  
Gonzalez R. C., 1992, DIGITAL IMAGE PROCES, V2nd