共 27 条
[1]
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2008, 38 (04)
:943-949
[3]
[Anonymous], 1996, Neuro-dynamic programming
[4]
BRADTKE SJ, 1994, PROCEEDINGS OF THE 1994 AMERICAN CONTROL CONFERENCE, VOLS 1-3, P3475
[5]
Dierks T, 2010, P AMER CONTR CONF, P1568
[8]
Kiumarsi-Khomartash B, 2013, IEEE DECIS CONTR P, P3845, DOI 10.1109/CDC.2013.6760476
[9]
Lancaster P., 1995, Algebraic Riccati Equations
[10]
Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2011, 41 (01)
:14-25