iLike: Bridging the Semantic Gap in Vertical Image Search by Integrating Text and Visual Features

被引:22
作者
Chen, Yuxin [1 ]
Sampathkumar, Hariprasad [2 ]
Luo, Bo [2 ]
Chen, Xue-wen [3 ]
机构
[1] ETH, Dept Comp Sci, CH-8092 Zurich, Switzerland
[2] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
[3] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
基金
美国国家科学基金会;
关键词
CBIR; specialized search; vertical search engine; ANNOTATION; RETRIEVAL; COLOR;
D O I
10.1109/TKDE.2012.192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of Internet and Web 2.0, large-volume multimedia contents have been made available online. It is highly desired to provide easy accessibility to such contents, i.e., efficient and precise retrieval of images that satisfies users' needs. Toward this goal, content-based image retrieval (CBIR) has been intensively studied in the research community, while text-based search is better adopted in the industry. Both approaches have inherent disadvantages and limitations. Therefore, unlike the great success of text search, web image search engines are still premature. In this paper, we present iLike, a vertical image search engine that integrates both textual and visual features to improve retrieval performance. We bridge the semantic gap by capturing the meaning of each text term in the visual feature space, and reweight visual features according to their significance to the query terms. We also bridge the user intention gap because we are able to infer the "visual meanings" behind the textual queries. Last but not least, we provide a visual thesaurus, which is generated from the statistical similarity between the visual space representation of textual terms. Experimental results show that our approach improves both precision and recall, compared with content-based or text-based image retrieval techniques. More importantly, search results from iLike is more consistent with users' perception of the query terms.
引用
收藏
页码:2257 / 2270
页数:14
相关论文
共 60 条
[1]  
[Anonymous], PRACTICAL NONPARAMET
[2]  
[Anonymous], P 18 INT C WORLD WID
[3]  
[Anonymous], P IEEE INT C IM PROC
[4]  
[Anonymous], P ACM SIGIR C RES DE
[5]  
[Anonymous], P ACM INT C MULT
[6]  
[Anonymous], 1995, STORAGE RETRIEVAL IM, DOI DOI 10.1117/12.205308
[7]  
Aslandogan Y. A., 1997, P ACM SIGIR C RES DE
[8]   Matching words and pictures [J].
Barnard, K ;
Duygulu, P ;
Forsyth, D ;
de Freitas, N ;
Blei, DM ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1107-1135
[9]  
CAI D, 2004, P 12 ACM INT C MULT
[10]   Supervised learning of semantic classes for image annotation and retrieval [J].
Carneiro, Gustavo ;
Chan, Antoni B. ;
Moreno, Pedro J. ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (03) :394-410