Structural image retrieval using automatic image annotation and region based inverted file

被引:10
作者
Zhang, Dengsheng [1 ]
Islam, Md. Monirul [2 ]
Lu, Guojun [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Churchill, Vic 3842, Australia
[2] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh
关键词
Machine learning; Image indexing and searching; Image annotation; Vector quantization; Inverted file; Multi-instance learning; Bag-of-features; Region annotation; CLASSIFICATION; SEGMENTATION; FEATURES; SVMS;
D O I
10.1016/j.jvcir.2013.07.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image retrieval has lagged far behind text retrieval despite more than two decades of intensive research effort. Most of the research on image retrieval in the last two decades are on content based image retrieval or image retrieval based on low level features. Recent research in this area focuses on semantic image retrieval using automatic image annotation. Most semantic image retrieval techniques in literature, however, treat an image as a bag of features/words while ignore the structural or spatial information in the image. In this paper, we propose a structural image retrieval method based on automatic image annotation and region based inverted file. In the proposed system, regions in an image are treated the same way as keywords in a structural text document, semantic concepts are learnt from image data to label image regions as keywords and weight is assigned to each keyword according to spatial position and relationship. As the result, images are indexed and retrieved in the same way as structural document retrieval. Specifically, images are broken down to regions which are represented using colour, texture and shape features. Region features are then quantized to create visual dictionaries which are similar to monolingual dictionaries like English or Chinese dictionaries. In the next step, a semantic dictionary similar to a bilingual dictionary like the English-Chinese dictionary is learnt to mapping image regions to semantic concepts. Finally, images are then indexed and retrieved using a novel region based inverted file data structure. Results show the proposed method has significant advantage over the widely used Bayesian annotation models. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:1087 / 1098
页数:12
相关论文
共 59 条
[1]  
[Anonymous], 2007, Principles of Data Mining: Undergraduate Topics in Computer Science
[2]   Fast approximate energy minimization via graph cuts [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) :1222-1239
[3]   Ontology-based image classification using neural networks [J].
Breen, C ;
Khan, L ;
Kumar, A ;
Lei, W .
INTERNET MULTIMEDIA MANAGEMENT SYSTEMS III, 2002, 4862 :198-208
[4]  
Cao L., 2007, ICCV
[5]   Supervised learning of semantic classes for image annotation and retrieval [J].
Carneiro, Gustavo ;
Chan, Antoni B. ;
Moreno, Pedro J. ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (03) :394-410
[6]  
Chang WJ, 2006, LECT NOTES COMPUT SC, V3832, P647, DOI 10.1145/1180639.1180774
[7]   Support vector machines for histogram-based image classification [J].
Chapelle, O ;
Haffner, P ;
Vapnik, VN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1055-1064
[8]  
Cusano C., 2004, P INT IM 4, V5304
[9]   Image retrieval: Ideas, influences, and trends of the new age [J].
Datta, Ritendra ;
Joshi, Dhiraj ;
Li, Jia ;
Wang, James Z. .
ACM COMPUTING SURVEYS, 2008, 40 (02)
[10]   Use of neural networks for automatic classification from high-resolution images [J].
Del Frate, Fabio ;
Pacifici, Fabio ;
Schiavon, Giovanni ;
Solimini, Chiara .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (04) :800-809