共 22 条
[11]
PARR R, 1998, ADV NEURAL INFORMATI, V10
[12]
PRECUP D, 1997, 1997 AAAI FALL S MOD
[13]
RUMMERY GA, 1994, 16L CUEDFINFENGTR
[14]
SINGER B, 1999, CMUCS99122
[15]
Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479
[16]
Sutton R.S., 1984, Temporal Credit Assignment in Reinforcement Learning
[17]
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[18]
Sutton RS, 1996, ADV NEUR IN, V8, P1038
[19]
Sutton-Tyrrell K, 1999, CIRCULATION, V99, P1105
[20]
TADEPALLI P, 1997, P INT C MACH LEARN S, P358