共 34 条
[2]
Altman E., 1998, Constrained Markov decision processes
[4]
ANDREWS M, 2005, P 2005 IMA SUMM WORK
[5]
[Anonymous], 1996, Neuro-dynamic programming
[7]
Bertsekas D., 2001, Dynamic Programming and Optimal Control, Two Volume Set
[10]
Chen H. F., 2002, STOCHASTIC APPROXIMA