共 46 条
[2]
[Anonymous], 1967, SIAM J. Control, V5, P54
[5]
Bertsekas D. P., 2020, Rollout, Policy Iteration, and Distributed Reinforcement Learning
[7]
Bertsekas DP., 1995, DYNAMIC PROGRAMMING
[10]
Broussard J. R., 1983, Proceedings of the 1983 American Control Conference, P1026