On Sampled Metrics for Item Recommendation

被引:299
作者
Krichene, Walid [1 ]
Rendle, Steffen [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
来源
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2020年
关键词
Item Recommendation; Evaluation; Metrics; Sampled Metric;
D O I
10.1145/3394486.340226
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of item recommendation requires ranking a large catalogue of items given a context. Item recommendation algorithms are evaluated using ranking metrics that depend on the positions of relevant items. To speed up the computation of metrics, recent work often uses sampled metrics where only a smaller set of random items and the relevant items are ranked. This paper investigates sampled metrics in more detail and shows that they are inconsistent with their exact version, in the sense that they do not persist relative statements, e.g., recommender A is better than B, not even in expectation. Moreover, the smaller the sampling size, the less difference there is between metrics, and for very small sampling size, all metrics collapse to the AUC metric. We show that it is possible to improve the quality of the sampled metrics by applying a correction, obtained by minimizing different criteria such as bias or mean squared error. We conclude with an empirical evaluation of the naive sampled metrics and their corrected variants. To summarize, our work suggests that sampling should be avoided for metric calculation, however if an experimental study needs to sample, the proposed corrections can improve the quality of the estimate.
引用
收藏
页码:1748 / 1757
页数:10
相关论文
共 18 条
[1]  
Aiolli F., 2013, P 7 ACM C REC SYST, P273
[2]  
Barlow R.E., 1972, Statistical inference under order restrictions
[3]   A Generic Coordinate Descent Framework for Learning from Implicit Feedback [J].
Bayer, Immanuel ;
He, Xiangnan ;
Kanagal, Bhargav ;
Rendle, Steffen .
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, :1341-1350
[4]  
Bengio Y., 2003, P C ART INT STAT AIS
[5]   Adaptive importance sampling to accelerate training of a neural probabilistic language model [J].
Bengio, Yoshua ;
Senecal, Jean-Sebastien .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (04) :713-722
[6]  
Blanc G, 2018, PR MACH LEARN RES, V80
[7]   Collaborative Memory Network for Recommendation Systems [J].
Ebesu, Travis ;
Shen, Bin ;
Fang, Yi .
ACM/SIGIR PROCEEDINGS 2018, 2018, :515-524
[8]   The MovieLens Datasets: History and Context [J].
Harper, F. Maxwell ;
Konstan, Joseph A. .
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2016, 5 (04)
[9]   Neural Collaborative Filtering [J].
He, Xiangnan ;
Liao, Lizi ;
Zhang, Hanwang ;
Nie, Liqiang ;
Hu, Xia ;
Chua, Tat-Seng .
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, :173-182
[10]   Leveraging Meta-path based Context for Top-N Recommendation with A Neural Co-Attention Model [J].
Hu, Binbin ;
Shi, Chuan ;
Zhao, Wayne Xin ;
Yu, Philip S. .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1531-1540