共 45 条
- [1] Afsar M Mehdi, 2021, ACM COMPUTING SURVEY
- [2] [Anonymous], 2019, ARXIV190300374
- [3] [Anonymous], 2016, INT C MACH LEARN
- [4] Bai XY, 2019, ADV NEUR IN, V32
- [5] Optimization Methods for Large-Scale Machine Learning [J]. SIAM REVIEW, 2018, 60 (02) : 223 - 311
- [6] Cai Tianchi, 2023, WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, P186, DOI 10.1145/3539597.3570486
- [7] Off-Policy Actor-critic for Recommender Systems [J]. PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 338 - 349
- [8] User Response Models to Improve a REINFORCE Recommender System [J]. WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 121 - 129
- [9] Top-K Off-Policy Correction for a REINFORCE Recommender System [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 456 - 464
- [10] Chen X., 2021, ARXIV210903540