Non-binary evaluation of next-basket food recommendation

被引:0
作者
Liu, Yue [1 ]
Achananuparp, Palakorn [1 ]
Lim, Ee-Peng [1 ]
机构
[1] Singapore Management Univ, Singapore, Singapore
关键词
Next-basket recommendation; Food recommendation; Non-binary relevance; Evaluation metrics; User study; SENTENCE SIMILARITY; MUSIC;
D O I
10.1007/s11257-023-09369-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Next-basket recommendation (NBR) is a recommendation task that predicts a basket or a set of items a user is likely to adopt next based on his/her history of basket adoption sequences. It enables a wide range of novel applications and services from predicting next basket of items for grocery shopping to recommending food items a user is likely to consume together in the next meal. Even though much progress has been made in the algorithmic NBR research over the years, little research has been done to broaden knowledge about the evaluation of NBR methods, which is largely based on the offline evaluation experiments and binary relevance paradigm. Specifically, we argue that recommended baskets which are more similar to ground truth baskets are better recommendations than those that share little resemblance to the ground truth, and therefore, they should be granted some partial credits. Based on this notion of non-binary relevance assessment, we propose new evaluation metrics for NBR by adapting and extending similarity metrics from natural language processing (NLP) and text classification research. To validate the proposed metrics, we conducted two user studies on the next-meal food recommendation using numerous state-of-the-art NBR methods in both online and offline evaluation settings. Our findings show that the offline performance assessment based on the proposed non-binary evaluation metrics is more representative of the online evaluation performance than that of the standard evaluation metrics.
引用
收藏
页码:183 / 227
页数:45
相关论文
共 106 条
  • [1] Achananuparp P., 2016, ARXIV
  • [2] Achananuparp P, 2008, LECT NOTES COMPUT SC, V5182, P305, DOI 10.1007/978-3-540-85836-2_29
  • [3] Eat & Tell: A Randomized Trial of Random-Loss Incentive to Increase Dietary Self-Tracking Compliance
    Achananuparp, Palakorn
    Lim, Ee-Peng
    Abhishek, Vibhanshu
    Yun, Tianjiao
    [J]. DH '18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, 2018, : 45 - 54
  • [4] Achananuparp P, 2009, LECT NOTES ARTIF INT, V5476, P548, DOI 10.1007/978-3-642-01307-2_52
  • [5] [Anonymous], 2012, P 18 ACM SIGKDD INT, DOI [10.1145/2339530.2339643, DOI 10.1145/2339530.2339643]
  • [6] Arora S, 2017, 5 INT C LEARN REPR I
  • [7] Beel J., 2013, P INT WORKSHOP REPRO, P7, DOI 10.1145/2532508.2532511.
  • [8] Towards reproducibility in recommender-systems research
    Beel, Joeran
    Breitinger, Corinna
    Langer, Stefan
    Lommatzsch, Andreas
    Gipp, Bela
    [J]. USER MODELING AND USER-ADAPTED INTERACTION, 2016, 26 (01) : 69 - 101
  • [9] RecGAN: Recurrent Generative Adversarial Networks for Recommendation Systems
    Bharadhwaj, Homanga
    Park, Homin
    Lim, Brian Y.
    [J]. 12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 372 - 376
  • [10] Location-aware music recommendation
    Braunhofer, Matthias
    Kaminskas, Marius
    Ricci, Francesco
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2013, 2 (01) : 31 - 44