共 53 条
[2]
Anderson C.W., 1987, P 4 INT WORKSH MACH
[3]
[Anonymous], 1989, LEARNING DELAYED REW
[4]
[Anonymous], 1993, P 1993 CONN MOD SUMM
[7]
Baird L. C., 1995, ICML 95 P 12 INT C M
[8]
Barto A. G., 1990, P 1990 CONN MOD SUMM
[9]
NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS,
1983, 13 (05)
:834-846
[10]
Bellman R., 1957, DYNAMIC PROGRAMMING, V1st