共 33 条
[2]
[Anonymous], 2012, REINFORCEMENT LEARNI
[3]
[Anonymous], 1996, Neuro-dynamic programming
[4]
[Anonymous], 2013, Optimal adaptive control and differential games by reinforcement learning principles
[6]
Dierks T, 2010, P AMER CONTR CONF, P1568
[7]
Optimal Tracking Control of Affine Nonlinear Discrete-time Systems with Unknown Internal Dynamics
[J].
PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009),
2009,
:6750-6755
[8]
Finlayson B.A., 1990, The method of weighted residuals and variational principles
[10]
Howard R. A., 1960, Dynamic programming and Markov processes