共 19 条
- [2] [Anonymous], 2000, Dynamic programming and optimal control
- [3] Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
- [4] BRADKTE S, 1994, THESIS U MASSACHUSET
- [5] BRADTKE SJ, 1994, PROCEEDINGS OF THE 1994 AMERICAN CONTROL CONFERENCE, VOLS 1-3, P3475
- [6] Christopher John Cornish Hellaby Watkins, 1989, Learning from Delayed Rewards
- [7] Goodwin G. C., 1984, Adaptive filtering prediction and control
- [9] HAGEN S, 2001, THESIS U AMSTERDAM
- [10] Adaptive critic methods for stochastic systems with input-dependent noise [J]. AUTOMATICA, 2007, 43 (08) : 1355 - 1362