共 33 条
- [2] [Anonymous], 2012, REINFORCEMENT LEARNI
- [3] [Anonymous], 1996, Neuro-dynamic programming
- [4] [Anonymous], 2013, Optimal adaptive control and differential games by reinforcement learning principles
- [6] Dierks T, 2010, P AMER CONTR CONF, P1568
- [7] Optimal Tracking Control of Affine Nonlinear Discrete-time Systems with Unknown Internal Dynamics [J]. PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 6750 - 6755
- [8] Finlayson B.A., 1990, The method of weighted residuals and variational principles
- [10] Howard R. A., 1960, Dynamic programming and Markov processes