共 63 条
[51]
Puterman M., 1994, MARKOV DECISION PROC
[52]
Rummery G. A., 1994, CUEDFINFENGTR166
[53]
LEARNING APPLIED TO SUCCESSIVE APPROXIMATION ALGORITHMS
[J].
IEEE TRANSACTIONS ON SYSTEMS SCIENCE AND CYBERNETICS,
1970, SSC6 (02)
:97-&
[54]
Singh SP, 1996, MACH LEARN, V22, P123, DOI 10.1007/BF00114726
[55]
Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479
[56]
Sutton R.S., 1990, P 7 INT C MACHINE LE, P216
[57]
Sutton R.S., 1984, Temporal Credit Assignment in Reinforcement Learning
[58]
SUTTON RS, 1996, ADV NEURAL INFORMATI, V8
[59]
FUZZY IDENTIFICATION OF SYSTEMS AND ITS APPLICATIONS TO MODELING AND CONTROL
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS,
1985, 15 (01)
:116-132
[60]
TESAURO G, 1992, MACH LEARN, V8, P257, DOI 10.1007/BF00992697