共 9 条
- [1] Barbosa M., 2009, P INT C INT IN PRESS
- [2] Bertsekas Dimitri P, 2000, Dynamic programming and optimal control, V1
- [3] Choi M., 2007, CONTR AUT SYST 2007, P1222
- [4] Chung T., 2004, DEC CONTR 2004 CDC 4, V2, P1914
- [5] Park JJ, 2007, INT J CONTROL AUTOM, V5, P674
- [6] Roy N., 2000, ADV NEURAL INFORM PR, V12
- [7] SANFELIU A, 2006, P IEEE RSJ IROS WORK
- [8] SINGH A, 2007, INT JOINT C ART INT
- [9] Sutton R.S., 1998, Introduction to reinforcement learning, V2