共 54 条
[1]
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2008, 38 (04)
:943-949
[2]
[Anonymous], 1996, Neuro-dynamic programming
[3]
Bertsekas D. P., 2007, Dynamic Programming and Optimal Control
[9]
Reinforcement Learning and Feedback Control USING NATURAL DECISION METHODS TO DESIGN OPTIMAL ADAPTIVE CONTROLLERS
[J].
IEEE CONTROL SYSTEMS MAGAZINE,
2012, 32 (06)
:76-105
[10]
Train Rescheduling With Stochastic Recovery Time: A New Track-Backup Approach
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,
2014, 44 (09)
:1216-1233