共 50 条
- [32] Continuous Parameter Control in Genetic Algorithms using Policy Gradient Reinforcement Learning PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 115 - 122
- [33] Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching APPLIED MATHEMATICS AND OPTIMIZATION, 2025, 91 (01):
- [34] Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
- [35] QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning IEEE ACCESS, 2021, 9 : 129728 - 129741
- [36] Reinforcement Learning for Mobile Robot Obstacle Avoidance with Deep Deterministic Policy Gradient INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT III, 2022, 13457 : 197 - 204
- [38] Learning Heuristics for the TSP by Policy Gradient INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2018, 2018, 10848 : 170 - 181
- [39] An Information-Theoretic Analysis of Bayesian Reinforcement Learning 2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
- [40] Bayesian reinforcement learning for navigation planning in unknown environments FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7