TextFinder: An automatic system to detect and recognize text in images

被引:220
作者
Wu, V [1 ]
Manmatha, R [1 ]
Riseman, EM [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
text reading; character recognition; multimedia indexing; text detection; texture segmentation; filters; hierarchical processing; binarization; connected-components;
D O I
10.1109/34.809116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust system is proposed to automatically detect and extract text in images from different sources, including video, newspapers, advertisements, stock certificates, photographs, and checks. Text is first detected using multiscale texture segmentation and spatial cohesion constraints, then cleaned up and extracted using a histogram-based binarization algorithm. An automatic performance evaluation scheme is also proposed.
引用
收藏
页码:1224 / 1229
页数:6
相关论文
共 19 条
[1]   OMNIDOCUMENT TECHNOLOGIES [J].
BOKSER, M .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1066-1078
[2]   Multiscale segmentation of unstructured document pages using soft decision integration [J].
Etemad, K ;
Doermann, D ;
Chellappa, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (01) :92-96
[3]   A ROBUST ALGORITHM FOR TEXT STRING SEPARATION FROM MIXED TEXT GRAPHICS IMAGES [J].
FLETCHER, LA ;
KASTURI, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (06) :910-918
[4]   Automatic text location in images and video frames [J].
Jain, AK ;
Yu, B .
PATTERN RECOGNITION, 1998, 31 (12) :2055-2076
[5]  
KAMEL M, 1993, CVGIP-GRAPH MODEL IM, V55, P203, DOI 10.1006/cgip.1993.1015
[6]   PREATTENTIVE TEXTURE-DISCRIMINATION WITH EARLY VISION MECHANISMS [J].
MALIK, J ;
PERONA, P .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1990, 7 (05) :923-932
[7]   HISTORICAL REVIEW OF OCR RESEARCH-AND-DEVELOPMENT [J].
MORI, S ;
SUEN, CY ;
YAMAMOTO, K .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1029-1058
[8]   A PROTOTYPE DOCUMENT IMAGE-ANALYSIS SYSTEM FOR TECHNICAL JOURNALS [J].
NAGY, G ;
SETH, S ;
VISWANATHAN, M .
COMPUTER, 1992, 25 (07) :10-22
[9]  
NEVATIA R, 1977, IEEE T SYST MAN CYB, V7, P820
[10]   POSTAL ADDRESS BLOCK LOCATION IN REAL-TIME [J].
PALUMBO, PW ;
SRIHARI, SN ;
SOH, J ;
SRIDHAR, R ;
DEMJANENKO, V .
COMPUTER, 1992, 25 (07) :34-42