Towards Mobile Document Image Retrieval for Digital Library

被引:17
作者
Duan, Ling-Yu [1 ]
Ji, Rongrong [1 ,2 ]
Chen, Zhang [1 ]
Huang, Tiejun [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Inst Digital Media, Beijing 100871, Peoples R China
[2] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
关键词
Digital library; Hamming space; inner-distance; JBIG2; compression; K-D tree; line drawing retrieval; mobile visual search; shape context; SHAPE; RECOGNITION; SIMILARITY;
D O I
10.1109/TMM.2013.2293063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the proliferation of mobile devices, recent years have witnessed an emerging potential to integrate mobile visual search techniques into digital library. Such a mobile application scenario in digital library has posed significant and unique challenges in document image search. The mobile photograph makes it tough to extract discriminative features from the landmark regions of documents, like line drawings, as well as text layouts. In addition, both search scalability and query delivery latency remain challenging issues in mobile document search. The former relies on an effective yet memory-light indexing structure to accomplish fast online search, while the latter puts a bit budget constraint of query images over the wireless link. In this paper, we propose a novel mobile document image retrieval framework, consisting of a robust Local Inner-distance Shape Context (LISC) descriptor of line drawings, a Hamming distance KD-Tree for scalable and memory-light document indexing, as well as a JBIG2 based query compression scheme, together with a Retinex based enhancement and an OTSU based binarization, to reduce the latency of delivering query while maintaining query quality in terms of search performance. We have extensively validated the key techniques in this framework by quantitative comparison to alternative approaches.
引用
收藏
页码:346 / 359
页数:14
相关论文
共 38 条
[1]   Learning to detect objects in images via a sparse, part-based representation [J].
Agarwal, S ;
Awan, A ;
Roth, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (11) :1475-1490
[2]  
[Anonymous], P NIPS
[3]  
[Anonymous], 2009, P 18 INT C WORLD WID
[4]  
[Anonymous], 2006, 2006 IEEE COMP SOC C
[5]  
[Anonymous], 2012, INT J COMPUT VISION, DOI DOI 10.1007/s11263-011-0472-9
[6]  
[Anonymous], 2012, IARCS ANN C FDN SOFT, DOI DOI 10.4230/LIPICS.FSTTCS.2012.48
[7]  
[Anonymous], OPT ENG
[8]  
[Anonymous], JTC1SC29WG1N1359 ISO
[9]  
[Anonymous], 1SC29WG11MPEG99 ISOI
[10]  
[Anonymous], 2008, BMVC