Document Indexing Framework for Retrieval of Degraded Document Images

被引:0
作者
Garg, Ritu [1 ]
Hassan, Ehtesham [2 ]
Chaudhury, Santanu [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, New Delhi, India
[2] TCS, Innovat Labs, Delhi, India
来源
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2015年
关键词
NEAREST-NEIGHBOR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the availability of large collection of document images in Indian languages, image based retrieval has gained popularity. The performance of such systems is effected by the presence of degraded and noisy images. Moreover, Optical character recognition systems for Indian scripts are not yet robust, leading to noisy OCR'ed text. Information retrieval system designed using inputs from both modalities (image features and OCR based recognition data) will lead to better retrieval performance in contrast to usage of individual modality. In this paper we present a indexing methodology that uses multiple kernel learning to combine features from different modalities by joint optimization of search time and accuracy. The evaluation of the proposed methodology is demonstrated on document images of Bangla and Devanagari script.
引用
收藏
页码:1261 / 1265
页数:5
相关论文
共 25 条
[1]   Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions [J].
Andoni, Alexandr ;
Indyk, Piotr .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :117-122
[2]  
[Anonymous], 2010, P 18 ACM INT C MULT
[3]  
[Anonymous], 2003, P 26 ANN INT ACM SIG
[4]  
[Anonymous], 2007, 2 INT C DIG LIB ICDL
[5]   Nearest neighbor retrieval using distance-based hashing [J].
Athitsos, Vassilis ;
Potamias, Michalis ;
Papapetrou, Panagiotis ;
Kollios, George .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :327-+
[6]  
Chandrika P., 2010, ACM INT C IM VID RET, P342
[7]  
Chen N., 2006, TECH REP
[8]   Image retrieval: Ideas, influences, and trends of the new age [J].
Datta, Ritendra ;
Joshi, Dhiraj ;
Li, Jia ;
Wang, James Z. .
ACM COMPUTING SURVEYS, 2008, 40 (02)
[9]   Design of Multi Kernel Distance based Hashing with multiple objectives for image indexing [J].
Gaur, Vaibhav ;
Hassan, Ehtesham ;
Chaudhury, Santanu .
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, :2637-2642
[10]  
Gionis A, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P518