Document image retrieval through word shape coding

被引:48
作者
Lu, Shijian [1 ]
Li, Linlin [2 ]
Tan, Chew Lim [2 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 119613, Singapore
[2] Natl Univ Singapore, Sch Comp, Dept Comp Sci, Singapore 117543, Singapore
关键词
document image retrieval; document image analysis; word shape coding;
D O I
10.1109/TPAMI.2008.89
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a document retrieval technique that is capable of searching document images without optical character recognition (OCR). The proposed technique retrieves document images by a new word shape coding scheme, which captures the document content through annotating each word image by a word shape code. In particular, we annotate word images by using a set of topological shape features including character ascenders/descenders, character holes, and character water reservoirs. With the annotated word shape codes, document images can be retrieved by either query keywords or a query document image. Experimental results show that the proposed document image retrieval technique is fast, efficient, and tolerant to various types of document degradation.
引用
收藏
页码:1913 / 1918
页数:6
相关论文
共 18 条
[1]  
BREUEL TM, 2005, P INT WORKSH DOC AN, P275
[2]  
CHEN FR, 1995, P SOC PHOTO-OPT INS, V2422, P256, DOI 10.1117/12.205828
[3]  
Khoubyari S., 1993, Proceedings. Second Annual Symposium on Document Analysis and Information Retrieval, P217
[4]   Content-based multimedia information retrieval: State of the art and challenges [J].
Lew, Michael S. ;
Sebe, Nicu ;
Djeraba, Chabane ;
Jain, Ramesh .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2006, 2 (01) :1-19
[5]   Retrieval of machine-printed Latin documents through Word Shape Coding [J].
Lu, Shijian ;
Tan, Chew Lim .
PATTERN RECOGNITION, 2008, 41 (05) :1799-1809
[6]   Script and language identification in noisy and degraded document images [J].
Lu Shijian ;
Tan, Chew Lim .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (01) :14-24
[7]   Information retrieval in document image databases [J].
Lu, Y ;
Tan, CL .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (11) :1398-1410
[8]  
NAKAYAMA T, 1996, P INT C COMP LING CO, P818
[9]  
NAKAYAMA T, 1994, P 4 C APPL NAT LANG, P22
[10]   THRESHOLD SELECTION METHOD FROM GRAY-LEVEL HISTOGRAMS [J].
OTSU, N .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (01) :62-66