Embedding-based News Recommendation for Millions of Users

被引:298
作者
Okura, Shumpei [1 ]
Tagami, Yukihiro [1 ]
Ono, Shingo [1 ]
Tajima, Akira [1 ]
机构
[1] Yahoo Japan Corp, Tokyo, Japan
来源
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2017年
关键词
News recommendations; Neural networks; Distributed representations; Large-scale services;
D O I
10.1145/3097983.3098108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is necessary to understand the content of articles and user preferences to make effective news recommendations. While ID-based methods, such as collaborative filtering and low-rank factorization, are well known for making recommendations, they are not suitable for news recommendations because candidate articles expire quickly and are replaced with new ones within short spans of time. Word-based methods, which are often used in information retrieval settings, are good candidates in terms of system performance but have issues such as their ability to cope with synonyms and orthographical variants and define "queries" from users' historical activities. This paper proposes an embedding-based method to use distributed representations in a three step end-to-end manner: (i) start with distributed representations of articles based on a variant of a denoising autoencoder, (ii) generate user representations by using a recurrent neural network (RNN) with browsing histories as input sequences, and (iii) match and list articles for users based on inner-product operations by taking system performance into consideration. The proposed method performed well in an experimental offline evaluation using past access data on Yahoo! JAPAN's home page. We implemented it on our actual news distribution system based on these experimental results and compared its online performance with a method that was conventionally incorporated into the system. As a result, the click-through rate (CTR) improved by 23% and the total duration improved by 10%, compared with the conventionally incorporated method. Services that incorporated the method we propose are already open to all users and provide recommendations to over ten million individual users per day who make billions of accesses per month.
引用
收藏
页码:1933 / 1942
页数:10
相关论文
共 22 条
  • [1] [Anonymous], 2007, Google news personalization: scalable online collaborative filtering, DOI DOI 10.1145/1242572.1242610
  • [2] [Anonymous], AC SPEECH SIGN PROC
  • [3] Cho K, 2014, ARXIV14061078, P1724, DOI [DOI 10.3115/V1/D14-1179, 10.3115/V1/D14-1179]
  • [4] Covington Jay Adams Paul, 2016, P 10 ACM C REC SYST
  • [5] Cramer Henriette, 2015, P 33 ANN ACM C HUM F
  • [6] Fan RE, 2008, J MACH LEARN RES, V9, P1871
  • [7] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [8] Hochreiter S., 1991, THESIS TU MUNCHEN
  • [9] Joachims Thorsten, 2002, P 8 ACM SIGKDD INT C
  • [10] Jozefowicz R, 2015, PR MACH LEARN RES, V37, P2342