共 50 条
[41]
A Reinforcement Learning Method to Trajectory Design for Manned Lunar Mission via Reshaping Rewards
[J].
ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL,
2023, 845
:5318-5329
[45]
Student-t policy in reinforcement learning to acquire global optimum of robot control
[J].
Applied Intelligence,
2019, 49
:4335-4347
[47]
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
[J].
Performance Evaluation Review,
2023, 51 (01)
:83-84
[50]
Applying and Verifying an Explainability Method Based on Policy Graphs in the Context of Reinforcement Learning
[J].
ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT,
2021, 339
:455-464