共 44 条
- [1] Agarwal Deepak, 2011, P 17 ACM SIGKDD INT, P132
- [2] Agarwal R., 2019, Striving for simplicity in off-policy deep reinforcement learning
- [3] [Anonymous], 2016, P 33 INT C MACH LEAR
- [4] Basu Kinjal, 2020, INT JOINT C ART INT
- [5] Brockman G., 2016, ARXIV PREPRINT ARXIV
- [6] Top-K Off-Policy Correction for a REINFORCE Recommender System [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 456 - 464
- [7] Fujimoto S, 2019, PR MACH LEARN RES, V97
- [8] Near Real-time Optimization of Activity-based Notifications [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 283 - 292
- [9] Email Volume Optimization at LinkedIn [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 97 - 106
- [10] Optimizing Email Volume For Sitewide Engagement [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1947 - 1955