An Adaptive Teleportation Random Walk Model for Learning Social Tag Relevance

被引:28
作者
Zhu, Xiaofei [1 ]
Nejdl, Wolfgang [1 ]
Georgescu, Mihai [1 ]
机构
[1] Leibniz Univ Hannover, Res Ctr L3S, Hannover, Germany
来源
SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2014年
关键词
Social Tag Relevance; Neighbor Voting; Random Walk; IMAGE RETRIEVAL;
D O I
10.1145/2600428.2609556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social tags are known to be a valuable source of information for image retrieval and organization. However, contrary to the conventional document retrieval, rich tag frequency information in social sharing systems, such as Flickr, is not available, thus we cannot directly use the tag frequency (analogous to the term frequency in a document) to represent the relevance of tags. Many heuristic approaches have been proposed to address this problem, among which the well-known neighbor voting based approaches are the most effective methods. The basic assumption of these methods is that a tag is considered as relevant to the visual content of a target image if this tag is also used to annotate the visual neighbor images of the target image by lots of different users. The main limitation of these approaches is that they treat the voting power of each neighbor image either equally or simply based on its visual similarity. In this paper, we cast the social tag relevance learning problem as an adaptive teleportation random walk process on the voting graph. In particular, we model the relationships among images by constructing a voting graph, and then propose an adaptive teleportation random walk, in which a confidence factor is introduced to control the teleportation probability, on the voting graph. Through this process, direct and indirect relationships among images can be explored to cooperatively estimate the tag relevance. To quantify the performance of our approach, we compare it with state-of-the-art methods on two publicly available datasets (NUS-WIDE and MIR Flickr). The results indicate that our method achieves substantial performance gains on these datasets.
引用
收藏
页码:223 / 232
页数:10
相关论文
共 25 条
[1]  
[Anonymous], 2008, P 16 INT C MULTIMEDI, DOI DOI 10.1145/1459359.1459577
[2]  
[Anonymous], 2009, PROC INT C WORLD WID
[3]  
[Anonymous], 2008, P 17 INT C WORLD WID, DOI DOI 10.1145/1367497.1367540
[4]  
[Anonymous], 2002, Proceedings of the 11th international conference on World Wide Web, DOI DOI 10.1145/511446.511513
[5]  
[Anonymous], P 2 ACM INT C MULT R
[6]  
[Anonymous], 1998, Technical report, DOI DOI 10.1007/978-3-319-08789-4_10
[7]   Overview of the MPEG-7 standard [J].
Chang, SF ;
Sikora, T ;
Puri, A .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (06) :688-695
[8]  
Chatzichristofis Savvas A., 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), P191, DOI 10.1109/WIAMIS.2008.24
[9]  
Chua T.-S., 2009, ACM INT C IM VID RET, P48
[10]   Random walks on the click graph [J].
Microsoft Research Cambridge, 7 JJ Thomson Ave, Cambridge, United Kingdom .
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07, 2007, :239-246