共 50 条
- [22] Fast Link Scheduling in Wireless Networks Using Regularized Off-Policy Reinforcement Learning IEEE Networking Letters, 2023, 5 (02): : 86 - 90
- [23] Off-policy evaluation for tabular reinforcement learning with synthetic trajectories Statistics and Computing, 2024, 34
- [28] An Optimistic Approach to the Temporal Difference Error in Off-Policy Actor-Critic Algorithms 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 875 - 883
- [29] Off-policy Learning for Multiple Loggers KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1184 - 1193