共 44 条
[1]
Bertsekas DimitriP., 2017, DYNAMIC PROGRAMMING, V1
[2]
Off-Policy Actor-critic for Recommender Systems
[J].
PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022,
2022,
:338-349
[3]
Top-K Off-Policy Correction for a REINFORCE Recommender System
[J].
PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19),
2019,
:456-464
[4]
Chen XS, 2019, 36 INT C MACHINE LEA, V97
[5]
Cheng Heng-Tze., 2016, P 1 WORKSHOP DEEP LE, P7, DOI 10.1145/2988450.2988454
[6]
Deep Neural Networks for YouTube Recommendations
[J].
PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16),
2016,
:191-198
[8]
Collaborative filtering recommender systems
[J].
Foundations and Trends in Human-Computer Interaction,
2010, 4 (02)
:81-173
[9]
Evans DS, 2008, REV NETW ECON, V7, P359