Learning a maximum margin subspace for image retrieval

被引：151

作者：

He, Xiaofei ^{[1
]}

Cai, Deng ^{[2
]}

Han, Jiawei ^{[2
]}

机构：

[1] Yahoo Inc, Burbank, CA 91504 USA

[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2008年 / 20卷 / 02期

关键词：

multimedia information systems; image retrieval; relevance feedback; dimensionality reduction;

D O I：

10.1109/TKDE.2007.190692

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One of the fundamental problems in Content-Based Image Retrieval (CBIR) has been the gap between low-level visual features and high-level semantic concepts. To narrow down this gap, relevance feedback is introduced into image retrieval. With the user-provided information, a classifier can be learned to distinguish between positive and negative examples. However, in real-world applications, the number of user feedbacks is usually too small compared to the dimensionality of the image space. In order to cope with the high dimensionality, we propose a novel semisupervised method for dimensionality reduction called Maximum Margin Projection (MMP). MMP aims at maximizing the margin between positive and negative examples at each local neighborhood. Different from traditional dimensionality reduction algorithms such as Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA), which effectively see only the global euclidean structure, MMP is designed for discovering the local manifold structure. Therefore, MMP is likely to be more suitable for image retrieval, where nearest neighbor search is usually involved. After projecting the images into a lower dimensional subspace, the relevant images get closer to the query image; thus, the retrieval performance can be enhanced. The experimental results on Corel image database demonstrate the effectiveness of our proposed algorithm.

引用

页码：189 / 201

页数：13

共 44 条

[1]

[Anonymous], P ACM SIGKDD

[2]

Belkin M, 2002, ADV NEUR IN, V14, P585

[3]

Belkin M, 2006, J MACH LEARN RES, V7, P2399

[4]

Bengio Y., 2003, Advances in Neural Information Processing Systems, V16

[5]

BI Y, 2004, P IEEE C COMP VIS PA

[6]

Cai D., 2007, P 20 INT JOINT C ART

[7]

CHANG CC, 2001, LIBSVM LIB SUPP VECT

[8] CBSA: Content-based soft annotation for multimodal image retrieval using Bayes point machines [J].

Chang, E ;

Goh, K ;

Sychay, G ;

Wu, G .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (01) :26-38

[9]

CHEN B, 2001, INT ARCH PHOTOGRAMME, V34, P37

[10]

Chen H. T., 2005, P IEEE INT C COMP VI

← 1 2 3 4 5 →