Reproducibility of Experiments in Recommender Systems Evaluation

被引：6

作者：

Polatidis, Nikolaos ^{[1
]}

Kapetanakis, Stelios ^{[1
,2
]}

Pimenidis, Elias ^{[3
]}

Kosmidis, Konstantinos ^{[4
]}

机构：

[1] Univ Brighton, Sch Comp Engn & Math, Brighton BN2 4GJ, E Sussex, England

[2] Gluru, Gluru Res, London WC2B 4HN, England

[3] Univ West England, Dept Comp Sci & Creat Technol, Bristol BS16 1QY, Avon, England

[4] Univ West London, Sch Comp & Engn, London W5 5RF, England

来源：

ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018 | 2018年 / 519卷

关键词：

Recommender systems; Evaluation; Reproducibility; Replication;

D O I：

10.1007/978-3-319-92007-8_34

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recommender systems evaluation is usually based on predictive accuracy metrics with better scores meaning recommendations of higher quality. However, the comparison of results is becoming increasingly difficult, since there are different recommendation frameworks and different settings in the design and implementation of the experiments. Furthermore, there might be minor differences on algorithm implementation among the different frameworks. In this paper, we compare well known recommendation algorithms, using the same dataset, metrics and overall settings, the results of which point to result differences across frameworks with the exact same settings. Hence, we propose the use of standards that should be followed as guidelines to ensure the replication of experiments and the reproducibility of the results.

引用

页码：401 / 409

页数：9

共 20 条

[1] A Hybrid CBR Approach for the Long Tail Problem in Recommender Systems
Alshammari, Gharbi
Jorro-Aragoneses, Jose L.
Kapetanakis, Stelios
Petridis, Miltos
Recio-Garcia, Juan A.
Diaz-Agudo, Belen
[J]. CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2017, 2017, 10339 : 35 - 45
[2] [Anonymous], 2014, SIGIR FORUM
[3] Towards reproducibility in recommender-systems research
Beel, Joeran
Breitinger, Corinna
Langer, Stefan
Lommatzsch, Andreas
Gipp, Bela
[J]. USER MODELING AND USER-ADAPTED INTERACTION, 2016, 26 (01) : 69 - 101
[4] Bellogin A., 2013, P 7 ACM C RECOMMENDE, P485
[5] Evaluation of recommender systems: A new approach
del Olmo, Felix Hernandez
Gaudioso, Elena
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 790 - 804
[6] Ekstrand M. D., 2011, P 5 ACM C RECOMMENDE, P133, DOI DOI 10.1145/2043932.2043958
[7] Felfernig A, 2011, RECOMMENDER SYSTEMS HANDBOOK, P187, DOI 10.1007/978-0-387-85820-3_6
[8] Gantner Z., 2011, Proceedings of the Fifth ACM Conference on Recommender Systems. RecSys '11, P305
[9] The MovieLens Datasets: History and Context
Harper, F. Maxwell
Konstan, Joseph A.
[J]. ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2016, 5 (04)
[10] Evaluating collaborative filtering recommender systems
Herlocker, JL
Konstan, JA
Terveen, K
Riedl, JT
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) : 5 - 53

← 1 2 →