共 26 条
[2]
ARAR AR, 1995, OPTIM CONTR APPL MET, V16, P149
[3]
Barto A. G., 1990, LEARNING COMPUTATION, P539
[4]
Bellman R., 1957, DYNAMIC PROGRAMMING
[5]
Christopher John Cornish Hellaby Watkins, 1989, LEARNING DELAYED REW
[6]
COLLINS E, 1999, P IEEE C DEC CONTR, V4, P4044
[8]
De Jong K. A., 1975, ANAL BEHAV CLASS GEN
[9]
DIXON K, 2002, 1 CARN MELL U
[10]
Howard R.A, 1960, Dynamic Programming and Markov Processes