共 30 条
- [1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
- [2] Bellman RE., 1957, Dynamic Programming
- [3] Bhasin S., 2010, P IEEE C DEC CONTR
- [4] Campos J., 1999, P IEEE AM CONTR C, V4
- [5] Dobre C, 2014, INT J INNOV COMPUT I, V10, P417
- [6] Reinforcement learning in continuous time and space [J]. NEURAL COMPUTATION, 2000, 12 (01) : 219 - 245
- [7] Dreyfus SE, 1977, ART THEORY DYNAMIC P
- [10] Lewis F.L., 1986, OPTIMAL CONTROL