Content-based image retrieval with the normalized information distance

被引:19
作者
Gondra, Iker
Heisterkamp, Douglas R.
机构
[1] St Francis Xavier Univ, Dept Math Stat & Comp Sci, Antigonish, NS B2G 2W5, Canada
[2] Oklahoma State Univ, Dept Comp Sci, Stillwater, OK 74078 USA
关键词
content-based image retrieval; normalized information distance; Kolmogorov complexity; compression; raw pixel data; visual content; similarity measure;
D O I
10.1016/j.cviu.2007.11.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main idea of content-based image retrieval (CBIR) is to search on an image's visual content directly. Typically, features (e.g., color, shape, texture) are extracted from each image and organized into a feature vector. Retrieval is performed by image example where a query image is given as input by the user and an appropriate metric is used to find the best matches in the corresponding feature space. We attempt to bypass the feature selection step (and the metric in the corresponding feature space) by following what we believe is the logical continuation of the CBIR idea of searching visual content directly. It is based on the observation that, since ultimately, the entire Visual content of an image is encoded into its raw data (i.e., the raw pixel values), in theory, it should be possible to determine image similarity based on the raw data alone. The main advantage of this approach is its simplicity in that explicit selection, extraction, and weighting of features is not needed. This work is an investigation into an image dissimilarity measure following from the theoretical foundation of the recently proposed normalized information distance (NID) [M. Li, X. Chen, X. Li, B. Ma, P. Vitanyi, The similarity metric, in: Proceedings of the 14th ACM-SIAM Symposium on Discrete Algorithms, 2003, pp. 863-872]. Approximations of the Kolmogorov complexity of an image are created by using different compression methods. Using those approximations, the NID between images is calculated and used as a metric for CBIR. The compression-based approximations to Kolmogorov complexity are shown to be valid by proving that they create statistically significant dissimilarity measures by testing them against a null hypothesis of random retrieval. Furthermore, when compared against several feature-based methods, the NID approach performed surprisingly well. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:219 / 228
页数:10
相关论文
共 61 条
[1]  
[Anonymous], 1993, Ten Lectures of Wavelets
[2]  
[Anonymous], 2003, Statistical pattern recognition
[3]   Chain letters & evolutionary histories [J].
Bennett, CH ;
Li, M ;
Ma, B .
SCIENTIFIC AMERICAN, 2003, 288 (06) :76-81
[4]   Information distance [J].
Bennett, CH ;
Gacs, P ;
Li, M ;
Vitanyi, FMB ;
Zurek, WH .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (04) :1407-1423
[5]   Blobworld: Image segmentation using expectation-maximization and its application to image querying [J].
Carson, C ;
Belongie, S ;
Greenspan, H ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1026-1038
[6]  
CHEN MY, 2005, P ACM INT C MULT, P902
[7]  
Chen YQ, 2001, IEEE IMAGE PROC, P34, DOI 10.1109/ICIP.2001.958946
[8]   A region-based fuzzy feature matching approach to content-based image retrieval [J].
Chen, YX ;
Wang, JZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (09) :1252-1267
[9]   An optimized interaction strategy for Bayesian relevance feedback [J].
Cox, IJ ;
Miller, ML ;
Minka, TP ;
Yianilos, PN .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :553-558
[10]  
Duda RO, 2006, PATTERN CLASSIFICATI