Structural image retrieval using automatic image annotation and region based inverted file

被引:10
作者
Zhang, Dengsheng [1 ]
Islam, Md. Monirul [2 ]
Lu, Guojun [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Churchill, Vic 3842, Australia
[2] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh
关键词
Machine learning; Image indexing and searching; Image annotation; Vector quantization; Inverted file; Multi-instance learning; Bag-of-features; Region annotation; CLASSIFICATION; SEGMENTATION; FEATURES; SVMS;
D O I
10.1016/j.jvcir.2013.07.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image retrieval has lagged far behind text retrieval despite more than two decades of intensive research effort. Most of the research on image retrieval in the last two decades are on content based image retrieval or image retrieval based on low level features. Recent research in this area focuses on semantic image retrieval using automatic image annotation. Most semantic image retrieval techniques in literature, however, treat an image as a bag of features/words while ignore the structural or spatial information in the image. In this paper, we propose a structural image retrieval method based on automatic image annotation and region based inverted file. In the proposed system, regions in an image are treated the same way as keywords in a structural text document, semantic concepts are learnt from image data to label image regions as keywords and weight is assigned to each keyword according to spatial position and relationship. As the result, images are indexed and retrieved in the same way as structural document retrieval. Specifically, images are broken down to regions which are represented using colour, texture and shape features. Region features are then quantized to create visual dictionaries which are similar to monolingual dictionaries like English or Chinese dictionaries. In the next step, a semantic dictionary similar to a bilingual dictionary like the English-Chinese dictionary is learnt to mapping image regions to semantic concepts. Finally, images are then indexed and retrieved using a novel region based inverted file data structure. Results show the proposed method has significant advantage over the widely used Bayesian annotation models. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:1087 / 1098
页数:12
相关论文
共 59 条
[21]  
Islam M.M., 2008, P IEEE INT C MULT EX
[22]  
Jeon J., 2003, Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P119, DOI DOI 10.1145/860435.860459
[23]  
Jin Y., 2005, P ACM MM 05
[24]   A fusion neural network classifier for image classification [J].
Kang, Sanggil ;
Park, Sungjoon .
PATTERN RECOGNITION LETTERS, 2009, 30 (09) :789-793
[25]  
Kim S, 2004, LECT NOTES COMPUT SC, V3115, P393
[26]   An image retrieval system by impression words and specific object names - IRIS [J].
Kuroda, K ;
Hagiwara, M .
NEUROCOMPUTING, 2002, 43 :259-276
[27]   Content-based multimedia information retrieval: State of the art and challenges [J].
Lew, Michael S. ;
Sebe, Nicu ;
Djeraba, Chabane ;
Jain, Ramesh .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2006, 2 (01) :1-19
[28]   Real-time computerized annotation of pictures [J].
Li, Jia ;
Wang, James Z. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (06) :985-1002
[29]   Region-based image retrieval with high-level semantics using decision tree learning [J].
Liu, Ying ;
Zhang, Dengsheng ;
Lu, Guojun .
PATTERN RECOGNITION, 2008, 41 (08) :2554-2570
[30]   A survey of content-based image retrieval with high-level semantics [J].
Liu, Ying ;
Zhang, Dengsheng ;
Lu, Guojun ;
Ma, Wei-Ying .
PATTERN RECOGNITION, 2007, 40 (01) :262-282