共 38 条
[6]
Lee JM, 2004, INT J CONTROL AUTOM, V2, P263
[7]
Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2011, 41 (01)
:14-25
[8]
Lewis F. L., 2013, Reinforcement Learning and Approximate Dynamic Programming for Feedback Control
[10]
Online Synchronous Approximate Optimal Learning Algorithm for Multiplayer Nonzero-Sum Games With Unknown Dynamics
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,
2014, 44 (08)
:1015-1027