The Role of Cores in Recommender Benchmarking for Social Bookmarking Systems

被引：5

作者：

Doerfel, Stephan ^{[1
]}

Jaeschke, Robert ^{[2
]}

Stumme, Gerd ^{[1
,2
]}

机构：

[1] Univ Kassel, Knowledge & Data Engn Grp KDE, Interdisciplinary Res Ctr Informat Syst Design IT, Wilhelmshoher Allee 73, D-34121 Kassel, Germany

[2] L3S Res Ctr, Appelstr 4, D-30167 Hannover, Germany

来源：

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY | 2016年 / 7卷 / 03期

关键词：

Algorithms; Experimentation; Measurement; Reliability; Recommender; core; benchmarking; graph; evaluation; preprocessing; TAG RECOMMENDATIONS;

D O I：

10.1145/2700485

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Social bookmarking systems have established themselves as an important part in today's Web. In such systems, tag recommender systems support users during the posting of a resource by suggesting suitable tags. Tag recommender algorithms have often been evaluated in offline benchmarking experiments. Yet, the particular setup of such experiments has rarely been analyzed. In particular, since the recommendation quality usually suffers from difficulties such as the sparsity of the data or the cold-start problem for new resources or users, datasets have often been pruned to so-called cores (specific subsets of the original datasets), without much consideration of the implications on the benchmarking results. In this article, we generalize the notion of a core by introducing the new notion of a set-core, which is independent of any graph structure, to overcome a structural drawback in the previous constructions of cores on tagging data. We show that problems caused by some types of cores can be eliminated using set-cores. Further, we present a thorough analysis of tag recommender benchmarking setups using cores. To that end, we conduct a large-scale experiment on four real-world datasets, in which we analyze the influence of different cores on the evaluation of recommendation algorithms. We can show that the results of the comparison of different recommendation approaches depends on the selection of core type and level. For the benchmarking of tag recommender algorithms, our results suggest that the evaluation must be set up more carefully and should not be based on one arbitrarily chosen core type and level.

引用

页数：33

共 50 条

[1] Impact of Data Characteristics on Recommender Systems Performance
Adomavicius, Gediminas
Zhang, Jingjing
[J]. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2012, 3 (01)
[2] Ahmed A, 2007, ASIA-PACIFIC SYMPOSIUM ON VISUALISATION 2007, PROCEEDINGS, P17
[3] Angelova Ralitsa, 2008, P EUR C ART INT MIN, P21
[4] [Anonymous], 2007, MSRTR200706
[5] [Anonymous], 2007, P EUR C COMPL SYST
[6] [Anonymous], 2001, Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence (UAI'01)
[7] [Anonymous], 2009, P 3 ACM C RECOMMENDE, DOI DOI 10.1145/1639714.1639726
[8] [Anonymous], 2007, ICWSM
[9] [Anonymous], 2008, Introduction to information retrieval
[10] [Anonymous], 2013, RECSYS 13, DOI DOI 10.1145/2507157.2507222

← 1 2 3 4 5 →