共 50 条
- [11] Optimal Control for Multi-agent Systems Using Off-Policy Reinforcement Learning 2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2022, : 135 - 140
- [12] Research on Off-Policy Evaluation in Reinforcement Learning: A Survey Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1926 - 1945
- [14] Re-attentive experience replay in off-policy reinforcement learning Machine Learning, 2024, 113 : 2327 - 2349
- [18] Safe Off-policy Reinforcement Learning Using Barrier Functions 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2176 - 2181
- [19] Off-policy evaluation for tabular reinforcement learning with synthetic trajectories Statistics and Computing, 2024, 34
- [20] Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 600 - 613