共 31 条
- [1] BARTO AG, 1989, COINS8995 U MASS TEC
- [2] BARTO AG, 1991, COINS9157 U MASS TEC
- [3] BARTO AG, 1990, CONNECTIONIST MODELS, P35
- [4] Bellman R. E., 1957, DYNAMIC PROGRAMMING
- [5] Berry DA., 1985, BANDIT PROBLEMS SEQU, DOI DOI 10.1007/978-94-015-3711-7
- [6] Bertsekas D.P., 1989, PARALLEL DISTRIBUTED
- [7] CHAPMAN D, 1990, TR9011 TEL RES TECHN
- [8] CHRISTIANSEN AD, 1990, IEEE C ROBOTICS AUTO, P1224
- [9] THE CONVERGENCE OF TD(LAMBDA) FOR GENERAL LAMBDA [J]. MACHINE LEARNING, 1992, 8 (3-4) : 341 - 362
- [10] KAELBLING LP, 1990, TR9004 STANF U DEP C