Reproducibility of Experiments in Recommender Systems Evaluation

被引:6
作者
Polatidis, Nikolaos [1 ]
Kapetanakis, Stelios [1 ,2 ]
Pimenidis, Elias [3 ]
Kosmidis, Konstantinos [4 ]
机构
[1] Univ Brighton, Sch Comp Engn & Math, Brighton BN2 4GJ, E Sussex, England
[2] Gluru, Gluru Res, London WC2B 4HN, England
[3] Univ West England, Dept Comp Sci & Creat Technol, Bristol BS16 1QY, Avon, England
[4] Univ West London, Sch Comp & Engn, London W5 5RF, England
来源
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018 | 2018年 / 519卷
关键词
Recommender systems; Evaluation; Reproducibility; Replication;
D O I
10.1007/978-3-319-92007-8_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recommender systems evaluation is usually based on predictive accuracy metrics with better scores meaning recommendations of higher quality. However, the comparison of results is becoming increasingly difficult, since there are different recommendation frameworks and different settings in the design and implementation of the experiments. Furthermore, there might be minor differences on algorithm implementation among the different frameworks. In this paper, we compare well known recommendation algorithms, using the same dataset, metrics and overall settings, the results of which point to result differences across frameworks with the exact same settings. Hence, we propose the use of standards that should be followed as guidelines to ensure the replication of experiments and the reproducibility of the results.
引用
收藏
页码:401 / 409
页数:9
相关论文
共 20 条
  • [1] A Hybrid CBR Approach for the Long Tail Problem in Recommender Systems
    Alshammari, Gharbi
    Jorro-Aragoneses, Jose L.
    Kapetanakis, Stelios
    Petridis, Miltos
    Recio-Garcia, Juan A.
    Diaz-Agudo, Belen
    [J]. CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2017, 2017, 10339 : 35 - 45
  • [2] [Anonymous], 2014, SIGIR FORUM
  • [3] Towards reproducibility in recommender-systems research
    Beel, Joeran
    Breitinger, Corinna
    Langer, Stefan
    Lommatzsch, Andreas
    Gipp, Bela
    [J]. USER MODELING AND USER-ADAPTED INTERACTION, 2016, 26 (01) : 69 - 101
  • [4] Bellogin A., 2013, P 7 ACM C RECOMMENDE, P485
  • [5] Evaluation of recommender systems: A new approach
    del Olmo, Felix Hernandez
    Gaudioso, Elena
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 790 - 804
  • [6] Ekstrand M. D., 2011, P 5 ACM C RECOMMENDE, P133, DOI DOI 10.1145/2043932.2043958
  • [7] Felfernig A, 2011, RECOMMENDER SYSTEMS HANDBOOK, P187, DOI 10.1007/978-0-387-85820-3_6
  • [8] Gantner Z., 2011, Proceedings of the Fifth ACM Conference on Recommender Systems. RecSys '11, P305
  • [9] The MovieLens Datasets: History and Context
    Harper, F. Maxwell
    Konstan, Joseph A.
    [J]. ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2016, 5 (04)
  • [10] Evaluating collaborative filtering recommender systems
    Herlocker, JL
    Konstan, JA
    Terveen, K
    Riedl, JT
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) : 5 - 53