共 50 条
- [42] TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1025 - 1032
- [43] Fuzzy state aggregation and off-policy reinforcement learning for stochastic environments PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON CONTROL AND APPLICATIONS, 2006, : 133 - +
- [44] Off-Policy Meta-Reinforcement Learning With Belief-Based Task Inference IEEE ACCESS, 2022, 10 : 49494 - 49507
- [48] High-Value Prioritized Experience Replay for Off-policy Reinforcement Learning 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1510 - 1514
- [50] Hyperparameter Tuning of an Off-Policy Reinforcement Learning Algorithm for H∞ Tracking Control LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211