Non-binary evaluation of next-basket food recommendation

被引：0

作者：

Liu, Yue ^{[1
]}

Achananuparp, Palakorn ^{[1
]}

Lim, Ee-Peng ^{[1
]}

机构：

[1] Singapore Management Univ, Singapore, Singapore

来源：

USER MODELING AND USER-ADAPTED INTERACTION | 2024年 / 34卷 / 01期

关键词：

Next-basket recommendation; Food recommendation; Non-binary relevance; Evaluation metrics; User study; SENTENCE SIMILARITY; MUSIC;

D O I：

10.1007/s11257-023-09369-8

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Next-basket recommendation (NBR) is a recommendation task that predicts a basket or a set of items a user is likely to adopt next based on his/her history of basket adoption sequences. It enables a wide range of novel applications and services from predicting next basket of items for grocery shopping to recommending food items a user is likely to consume together in the next meal. Even though much progress has been made in the algorithmic NBR research over the years, little research has been done to broaden knowledge about the evaluation of NBR methods, which is largely based on the offline evaluation experiments and binary relevance paradigm. Specifically, we argue that recommended baskets which are more similar to ground truth baskets are better recommendations than those that share little resemblance to the ground truth, and therefore, they should be granted some partial credits. Based on this notion of non-binary relevance assessment, we propose new evaluation metrics for NBR by adapting and extending similarity metrics from natural language processing (NLP) and text classification research. To validate the proposed metrics, we conducted two user studies on the next-meal food recommendation using numerous state-of-the-art NBR methods in both online and offline evaluation settings. Our findings show that the offline performance assessment based on the proposed non-binary evaluation metrics is more representative of the online evaluation performance than that of the standard evaluation metrics.

引用

页码：183 / 227

页数：45

共 106 条

[1] Achananuparp P., 2016, ARXIV
[2] Achananuparp P, 2008, LECT NOTES COMPUT SC, V5182, P305, DOI 10.1007/978-3-540-85836-2_29
[3] Eat & Tell: A Randomized Trial of Random-Loss Incentive to Increase Dietary Self-Tracking Compliance
Achananuparp, Palakorn
Lim, Ee-Peng
Abhishek, Vibhanshu
Yun, Tianjiao
[J]. DH '18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, 2018, : 45 - 54
[4] Achananuparp P, 2009, LECT NOTES ARTIF INT, V5476, P548, DOI 10.1007/978-3-642-01307-2_52
[5] [Anonymous], 2012, P 18 ACM SIGKDD INT, DOI [10.1145/2339530.2339643, DOI 10.1145/2339530.2339643]
[6] Arora S, 2017, 5 INT C LEARN REPR I
[7] Beel J., 2013, P INT WORKSHOP REPRO, P7, DOI 10.1145/2532508.2532511.
[8] Towards reproducibility in recommender-systems research
Beel, Joeran
Breitinger, Corinna
Langer, Stefan
Lommatzsch, Andreas
Gipp, Bela
[J]. USER MODELING AND USER-ADAPTED INTERACTION, 2016, 26 (01) : 69 - 101
[9] RecGAN: Recurrent Generative Adversarial Networks for Recommendation Systems
Bharadhwaj, Homanga
Park, Homin
Lim, Brian Y.
[J]. 12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 372 - 376
[10] Location-aware music recommendation
Braunhofer, Matthias
Kaminskas, Marius
Ricci, Francesco
[J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2013, 2 (01) : 31 - 44

← 1 2 3 4 5 6 7 8 9 10 →