共 49 条
[3]
Anschel Oron, 2017, P MACHINE LEARNING R, V70
[4]
Asadi K, 2017, PR MACH LEARN RES, V70
[6]
Chen HK, 2019, AAAI CONF ARTIF INTE, P3312
[7]
Off-Policy Actor-critic for Recommender Systems
[J].
PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022,
2022,
:338-349
[8]
Top-K Off-Policy Correction for a REINFORCE Recommender System
[J].
PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19),
2019,
:456-464
[10]
Deffayet R., 2023, ACM SIGIR FORUM, V56, P1