Content-based image retrieval with the normalized information distance

被引：19

作者：

Gondra, Iker

Heisterkamp, Douglas R.

机构：

[1] St Francis Xavier Univ, Dept Math Stat & Comp Sci, Antigonish, NS B2G 2W5, Canada

[2] Oklahoma State Univ, Dept Comp Sci, Stillwater, OK 74078 USA

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2008年 / 111卷 / 02期

关键词：

content-based image retrieval; normalized information distance; Kolmogorov complexity; compression; raw pixel data; visual content; similarity measure;

D O I：

10.1016/j.cviu.2007.11.001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The main idea of content-based image retrieval (CBIR) is to search on an image's visual content directly. Typically, features (e.g., color, shape, texture) are extracted from each image and organized into a feature vector. Retrieval is performed by image example where a query image is given as input by the user and an appropriate metric is used to find the best matches in the corresponding feature space. We attempt to bypass the feature selection step (and the metric in the corresponding feature space) by following what we believe is the logical continuation of the CBIR idea of searching visual content directly. It is based on the observation that, since ultimately, the entire Visual content of an image is encoded into its raw data (i.e., the raw pixel values), in theory, it should be possible to determine image similarity based on the raw data alone. The main advantage of this approach is its simplicity in that explicit selection, extraction, and weighting of features is not needed. This work is an investigation into an image dissimilarity measure following from the theoretical foundation of the recently proposed normalized information distance (NID) [M. Li, X. Chen, X. Li, B. Ma, P. Vitanyi, The similarity metric, in: Proceedings of the 14th ACM-SIAM Symposium on Discrete Algorithms, 2003, pp. 863-872]. Approximations of the Kolmogorov complexity of an image are created by using different compression methods. Using those approximations, the NID between images is calculated and used as a metric for CBIR. The compression-based approximations to Kolmogorov complexity are shown to be valid by proving that they create statistically significant dissimilarity measures by testing them against a null hypothesis of random retrieval. Furthermore, when compared against several feature-based methods, the NID approach performed surprisingly well. (C) 2007 Elsevier Inc. All rights reserved.

引用

页码：219 / 228

页数：10

共 61 条

[41]

OBERHUMER M, UCL COMPRESSION LIB

[42] A flexible content-based image retrieval system with combined scene description keyword [J].

Ono, A ;

Amano, M ;

Hakaridani, M ;

Satou, T ;

Sakauchi, M .

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, 1996, :201-208

[43] Probabilistic feature relevance learning for content-based image retrieval [J].

Peng, J ;

Bhanu, B ;

Qing, S .

COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 75 (1-2) :150-164

[44]

PENNEBAKER WB, 1993, JPEG STILL IMAGE DAT

[45]

PICARD R, MIT MEDIA LAB VISION

[46] MODELING BY SHORTEST DATA DESCRIPTION [J].

RISSANEN, J .

AUTOMATICA, 1978, 14 (05) :465-471

[47]

SCLAROFF S, 1997, 97005 BOST U CS DEP

[48]

SMEATON AF, 1999, P ACM SIGIR C RES DE, P174

[49] Content-based image retrieval at the end of the early years [J].

Smeulders, AWM ;

Worring, M ;

Santini, S ;

Gupta, A ;

Jain, R .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (12) :1349-1380

[50]

Smith J. R., 1996, Proceedings ACM Multimedia 96, P87, DOI 10.1145/244130.244151

← 1 2 3 4 5 6 7 →