Collaborative image retrieval via regularized metric learning

被引:35
作者
Si, Luo [1 ]
Jin, Rong
Hoi, Steven C. H.
Lyu, Michael R.
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Shatin, Hong Kong, Peoples R China
关键词
content-based image retrieval; relevance feedback; log-based relevance feedback; relevance feedback log; users; semantic gap; metric learning; regularization; semidefinite programming;
D O I
10.1007/s00530-006-0033-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In content-based image retrieval (CBIR), relevant images are identified based on their similarities to query images. Most CBIR algorithms are hindered by the semantic gap between the low-level image features used for computing image similarity and the high-level semantic concepts conveyed in images. One way to reduce the semantic gap is to utilize the log data of users' feedback that has been collected by CBIR systems in history, which is also called "collaborative image retrieval." In this paper, we present a novel metric learning approach, named "regularized metric learning," for collaborative image retrieval, which learns a distance metric by exploring the correlation between low-level image features and the log data of users' relevance judgments. Compared to the previous research, a regularization mechanism is used in our algorithm to effectively prevent overfitting. Meanwhile, we formulate the proposed learning algorithm into a semidefinite programming problem, which can be solved very efficiently by existing software packages and is scalable to the size of log data. An extensive set of experiments has been conducted to show that the new algorithm can substantially improve the retrieval accuracy of a baseline CBIR system using Euclidean distance metric, even with a modest amount of log data. The experiment also indicates that the new algorithm is more effective and more efficient than two alternative algorithms, which exploit log data for image retrieval.
引用
收藏
页码:34 / 44
页数:11
相关论文
共 36 条
[1]  
[Anonymous], 2003, ADV NEURAL INFORM PR
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]  
[Anonymous], 1999, 7 ACM INT C MULT, DOI DOI 10.1145/319878.319896
[4]  
ASHWIN TV, 2001, P IEEE C AC SPEECH S
[5]  
Belkin M., 2004, TR200406 U CHIC
[6]  
Blei D., 2003, P 26 ANN INT ACM SIG, P127, DOI DOI 10.1145/860435.860460
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]   An optimized interaction strategy for Bayesian relevance feedback [J].
Cox, IJ ;
Miller, ML ;
Minka, TP ;
Yianilos, PN .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :553-558
[9]  
Duygulu P, 2002, LECT NOTES COMPUT SC, V2353, P97
[10]  
Gill P. E., 1981, PRACTICAL OPTIMIZATI