共 45 条
[1]
Afsar M Mehdi, 2021, ACM COMPUTING SURVEY
[2]
[Anonymous], 2016, INT C MACH LEARN
[3]
Bai XY, 2019, ADV NEUR IN, V32
[5]
Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning
[J].
PROCEEDINGS OF THE SIXTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2023, VOL 1,
2023,
:186-194
[6]
Off-Policy Actor-critic for Recommender Systems
[J].
PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022,
2022,
:338-349
[7]
User Response Models to Improve a REINFORCE Recommender System
[J].
WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING,
2021,
:121-129
[8]
Top-K Off-Policy Correction for a REINFORCE Recommender System
[J].
PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19),
2019,
:456-464
[9]
Chen Xiaocong, 2021, ARXIV210903540
[10]
CHEN XY, 2019, PR MACH LEARN RES, V97