Probabilistic Topic Modeling, Reinforcement Learning, and Crowdsourcing for Personalized Recommendations

被引:4
作者
Tripolitakis, Evangelos [1 ]
Chalkiadakis, Georgios [1 ]
机构
[1] Tech Univ Crete, Sch Elect & Comp Engn, Khania, Greece
来源
MULTI-AGENT SYSTEMS AND AGREEMENT TECHNOLOGIES, EUMAS 2016 | 2017年 / 10207卷
关键词
Recommender systems; Applications of reinforcement learning; Graphical models; Crowdsourcing;
D O I
10.1007/978-3-319-59294-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We put forward an innovative use of probabilistic topic modeling (PTM) intertwined with reinforcement learning (RL), to provide personalized recommendations. Specifically, we model items under recommendation as mixtures of latent topics following a distribution with Dirichlet priors; this can be achieved via the exploitation of crowd-sourced information for each item. Similarly, we model the user herself as an "evolving" document represented by its respective mixture of latent topics. The user's topic distribution is appropriately updated each time she consumes an item. Recommendations are subsequently based on the divergence between the topic distributions of the user and available items. However, to tackle the exploration versus exploitation dilemma, we apply RL to vary the user's topic distribution update rate. Our method is immune to the notorious "cold start" problem, and it can effectively cope with changing user preferences. Moreover, it is shown to be competitive against state-of-the-art algorithms, outperforming them in terms of sequential performance.
引用
收藏
页码:157 / 171
页数:15
相关论文
共 24 条
[1]  
Agarwal D., 2010, P 3 ACM INT C WEB SE, P91, DOI [DOI 10.1145/1718487.1718499, 10.1145/1718487.1718499.]
[2]  
[Anonymous], 2014, P 27 INT C NEURAL IN
[3]  
[Anonymous], 2002, MALLET: A machine learning for language toolkit
[4]  
[Anonymous], 2009, Netflix prize documentation
[5]  
[Anonymous], 2001, P INT JOINT C ART IN
[6]  
[Anonymous], 2020, Reinforcement Learning, An Introduction
[7]  
Babas K., 2013, Proceedings of the 7th ACM Conference on Recommender Systems, P221
[8]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]   Multiagent learning using a variable learning rate [J].
Bowling, M ;
Veloso, M .
ARTIFICIAL INTELLIGENCE, 2002, 136 (02) :215-250