Personalized and object-centered tag recommendation methods for Web 2.0 applications

被引:38
作者
Belem, Fabiano M. [1 ]
Martins, Eder F. [1 ]
Almeida, Jussara M. [1 ]
Goncalves, Marcos A. [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, BR-31270010 Belo Horizonte, MG, Brazil
关键词
Tag recommendation; Relevance metrics; Personalization;
D O I
10.1016/j.ipm.2014.03.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several Web 2.0 applications allow users to assign keywords (or tags) to provide better organization and description of the shared content. Tag recommendation methods may assist users in this task, improving the quality of the available information and, thus, the effectiveness of various tag-based information retrieval services, such as searching, content recommendation and classification. This work addresses the tag recommendation problem from two perspectives. The first perspective, centered at the object, aims at suggesting relevant tags to a target object, jointly exploiting the following three dimensions: (i) tag cooccurrences, (ii) terms extracted from multiple textual features (e.g., title, description), and (iii) various metrics to estimate tag relevance. The second perspective, centered at both object and user, aims at performing personalized tag recommendation to a target objectuser pair, exploiting, in addition to the three aforementioned dimensions, a metric that captures user interests. In particular, we propose new heuristic methods that extend state-of-the-art strategies by including new metrics that estimate how accurately a candidate tag describes the target object. We also exploit three learning-to-rank (L2R) based techniques, namely, RankSVM, Genetic Programming (GP) and Random Forest (RF), for generating ranking functions that exploit multiple metrics as attributes to estimate the relevance of a tag to a given object or object-user pair. We evaluate the proposed methods using data from four popular Web 2.0 applications, namely, Bibsonomy, LastFM, YouTube and YahooVideo. Our new heuristics for object-centered tag recommendation provide improvements in precision over the best state-of-the-art alternative of 12% on average (up to 20% in any single dataset), while our new heuristics for personalized tag recommendation produce average gains in precision of 121% over the baseline. Similar performance gains are also achieved in terms of other metrics, notably recall, Normalized Discounted Cumulative Gain (NDCG) and Mean-Reciprocal Rank (MRR). Further improvements, for both object-centered (up to 23% in precision) and personalized tag recommendation (up to 13% in precision), can also be achieved with our new L2R-based strategies, which are flexible and can be easily extended to exploit other aspects of the tag recommendation problem. Finally, we also quantify the benefits of personalized tag recommendation to provide better descriptions of the target object when compared to object-centered recommendation by focusing only on the relevance of the suggested tags to the object. We find that our best personalized method outperforms the best object-centered strategy, with average gains in precision of 10%. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:524 / 553
页数:30
相关论文
共 61 条
[1]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[2]  
Almeida J., 2010, IEEE INTERNET COMPUT, V14
[3]  
[Anonymous], P ECML PKDD DISC CHA
[4]  
[Anonymous], 1998, Genetic programming: an introduction: on the automatic evolution of computer programs and its applications
[5]  
[Anonymous], 2003, Journal of machine learning research
[6]  
[Anonymous], 2007, ENCY MEASUREMENT STA
[7]  
[Anonymous], P 33 INT ACM SIGIR C
[8]  
[Anonymous], 2009, Proceedings of the 18th International Conference on World Wide Web, WWW '09, DOI 10.1145/1526709.1526758
[9]  
[Anonymous], 2009, P 3 ACM C RECOMMENDE, DOI DOI 10.1145/1639714.1639726
[10]  
[Anonymous], P 3 ACM INT C WEB SE, P81, DOI DOI 10.1145/1718487.1718498