共 50 条
- [1] SOAC: Supervised Off-Policy Actor -Critic for Recommender Systems 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 14121 - 14626
- [4] Noisy Importance Sampling Actor-Critic: An Off-Policy Actor-Critic With Experience Replay 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
- [5] Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
- [8] Supervised Advantage Actor-Critic for Recommender Systems WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1186 - 1196
- [9] Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2611 - 2616