The influence of personalization on tag query length in social media search

被引:9
作者
Clements, M. [1 ]
de Vries, A. P. [1 ,2 ]
Reinders, M. J. T. [1 ]
机构
[1] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, ICT Grp, NL-2628 CD Delft, Netherlands
[2] Ctr Math & Comp Sci CWI, NL-1098 SJ Amsterdam, Netherlands
关键词
Social media; Personalized search; Random walk; Collaborative tagging; Query length;
D O I
10.1016/j.ipm.2009.03.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social content systems contain enormous collections of unstructured user-generated content, annotated by the collaborative effort of regular Internet users. Tag-clouds have become popular interfaces that allow users to query the database of these systems by clicking relevant terms. However, these single click queries are often not expressive enough to effectively retrieve the desired content. Users have to use multiple clicks or type longer queries to satisfy their information need. To enhance the predicted content ranking we use a random walk model that effectively integrates the user's preference and semantically related query terms. We use the collaborative annotations from a popular on-line book catalog to create a social annotation graph and study the effect of personalization and smoothing for increasing query lengths. We show that personalization and smoothing allow the user to find equally relevant content with fewer query terms compared to a frequency based content ranking with TF-IDF weighing. As expected, we see that the influence of the random walk model disappears if users type more detailed queries. Finally, we discuss the observations with respect to synonyms and homographs which are well known to hamper the performance of information retrieval systems. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:403 / 412
页数:10
相关论文
共 24 条
[1]  
[Anonymous], 2008, P 17 INT C WORLD WID
[2]  
[Anonymous], 2007, P 16 INT C WORLD WID
[3]  
[Anonymous], 2003, Proceedings from SIGIR 2003: The 26th Annual International ACM Conference on Research and Development in Information Retrieval, DOI DOI 10.1145/860435.860474
[4]  
ARAMPATZIS, 2008, SIGIR 08, P811
[5]  
BEGELMAN G, 2006, WWW2006
[6]  
CLEMENTS M, TASK DEPEND IN PRESS
[7]  
CRASWELL N, 2007, SIGIR, P239, DOI DOI 10.1145/1277741.1277784
[8]   THE VOCABULARY PROBLEM IN HUMAN SYSTEM COMMUNICATION [J].
FURNAS, GW ;
LANDAUER, TK ;
GOMEZ, LM ;
DUMAIS, ST .
COMMUNICATIONS OF THE ACM, 1987, 30 (11) :964-971
[9]   Usage patterns of collaborative tagging systems [J].
Golder, SA ;
Huberman, BA .
JOURNAL OF INFORMATION SCIENCE, 2006, 32 (02) :198-208
[10]  
Hotho A, 2006, LECT NOTES COMPUT SC, V4011, P411