共 50 条
- [31] TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1025 - 1032
- [32] Fuzzy state aggregation and off-policy reinforcement learning for stochastic environments PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON CONTROL AND APPLICATIONS, 2006, : 133 - +
- [34] Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 600 - 613
- [35] Hyperparameter Tuning of an Off-Policy Reinforcement Learning Algorithm for H∞ Tracking Control LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
- [37] Benchmarking Off-Policy Deep Reinforcement Learning Algorithms for UAV Path Planning 2024 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2024, : 317 - 323
- [40] HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement Learning Agents IEEE ACCESS, 2024, 12 : 100102 - 100119