VisualRank: Applying PageRank to large-scale image search

被引:243
作者
Jing, Yushi [1 ,2 ]
Baluja, Shumeet [2 ]
机构
[1] Georgia Inst Technol, Mountain View, CA 94043 USA
[2] Google Inc, Res Grp, Mountain View, CA 94043 USA
关键词
image ranking; content-based image retrieval; eigenvector centrality; graph theory;
D O I
10.1109/TPAMI.2008.121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Because of the relative ease in understanding and processing text, commercial image-search systems often rely on techniques that are largely indistinguishable from text search. Recently, academic studies have demonstrated the effectiveness of employing image-based features to provide either alternative or additional signals to use in this process. However, it remains uncertain whether such techniques will generalize to a large number of popular Web queries and whether the potential improvement to search quality warrants the additional computational cost. In this work, we cast the image-ranking problem into the task of identifying "authority" nodes on an inferred visual similarity graph and propose VisualRank to analyze the visual link structures among images. The images found to be "authorities" are chosen as those that answer the image-queries well. To understand the performance of such an approach in a real system, we conducted a series of large-scale experiments based on the task of retrieving images for 2,000 of the most popular products queries. Our experimental results show significant improvement, in terms of user satisfaction and relevancy, in comparison to the most recent Google Image Search results. Maintaining modest computational cost is vital to ensuring that this procedure can be used in practice; we describe the techniques required to make this system practical for large-scale deployment in commercial search engines.
引用
收藏
页码:1877 / 1890
页数:14
相关论文
共 42 条
[1]  
[Anonymous], ACM MULTIMEDIA
[2]  
[Anonymous], P IEEE COMP SOC C CO
[3]  
[Anonymous], P C COMP VIS PATT RE
[4]  
[Anonymous], P C ADV NEUR INF PRO
[5]  
[Anonymous], 2007, Proceedings of the 15th In- ternational Conference on Multimedia
[6]  
BALUJA S, 2008, P 17 INT WORLD WID W
[7]   Speeded-Up Robust Features (SURF) [J].
Bay, Herbert ;
Ess, Andreas ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359
[8]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[9]   The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[10]   Blobworld: Image segmentation using expectation-maximization and its application to image querying [J].
Carson, C ;
Belongie, S ;
Greenspan, H ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1026-1038