共 9 条
[3]
Howard R. A., 1960, Dynamic programming and Markov processes
[5]
Lozano M., 2008, J NETWORKS COMP APP
[6]
Reynolds C.W., 1987, ACM ANN C COMP GRAPH, P25, DOI [10.1145/37402.37406, DOI 10.1145/37402.37406]
[7]
Stone P., 2005, ADAPTIVE BEHAV, V13
[8]
Taylor M. E., 2005, 4 IJC AUTONOMOUS AGE
[9]
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698