共 46 条
- [1] Agarwal R., 2019, STRIVING SIMPLICITY
- [2] Chen HK, 2019, AAAI CONF ARTIF INTE, P3312
- [3] Top-K Off-Policy Correction for a REINFORCE Recommender System [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 456 - 464
- [4] Chua K, 2018, ADV NEUR IN, V31
- [5] Deep Neural Networks for YouTube Recommendations [J]. PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16), 2016, : 191 - 198
- [6] Fu Justin, 2020, ARXIV200407219
- [7] Fujimoto S, 2019, PR MACH LEARN RES, V97
- [8] Gottesman O., 2018, EVALUATING REINFORCE
- [9] Gretton A, 2012, J MACH LEARN RES, V13, P723
- [10] Haarnoja T, 2018, PR MACH LEARN RES, V80